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VERTEBRATE EMBRYONIC PAHERN-INDUCING HEDGEHOG-LIKE PROTEINS. 
Background of the Invention 

Pattern fonnation is the activity by which emteyonic cells fonn ordered spatial 
arrangements of dififerentiated tissues. The physical complexity of higher organisms arises 
during emhiyogenesis through the interplay of cell-intrinsic lineage and cell-extrinsic 
signaling. Inductive interactions are essential to embryonic patterning in vertebrate 
development fiom the earliest establishment of the body plan, to the pa*»«ni!i£ of the organ 
systems, to the generation of diverse cell types during tissue differentiation (Davidson, E , 
(1990) Development 108: 365-389; Gurdon, J. B., (1992) 0/768: 185-199; Jessell, T. M. et 
al., (1992) Cell 68: 257-270). The effects of developmental cell interactions are varied. 
Typically, responding cells are diverted from one route of cell differentiation to another by 
inducing cells that differ from both the uninduced and induced states of the responding cells 
(inductions). Sometimes cells induce their neighbors to differentiate like themselves 
(homoiogenetic induction); in other cases a ceU inhibits its neighbors from differentiating 
like itself Cell mteractions in early development may be sequential, such that an initial 
induction between two cell types leads to a progressive amplification of diversity. Moreover, 
inductive interactions occur not only in embryos, but in adult cells as well, and can act to 
establish and maintain morphogenetic patterns as weU as induce differentiation (J.B. Guidon 
(1992) Cc/Z 68:185-199). 

The origin of the nervous system in all vertebrates can be traced to the end of 
gastrulation. At this time, the ectoderm in the dorsal side of the embryo changes its &te fiom 
epidermal to neural. The newly formed neuroectodemi fhickeiis to form a flattened structure 
called the neural plate which is characterized, in some vertebrates, by a central groove (neural 
groove) and thickened lateral edges (neural folds). At its early stages of differentiation, the 
neural plate already exhibits signs of regional differentiation along its anterior posterior (A-P) 
and mediolateral axis (M-L). The neural folds eventually fuse at the doreal midline to form 
the neural tube which will differentiate into brain at its anterior end and spinal cord at its 
posterior end. Closure of the neural tobe creates dorsal/ventral differences by virtue of 
previous mediolateral differentiation. Thus, at the end of neurulation, the neural tube has a 
clear anterior-posterior (A-P), dorsal ventral (D-V) and mediolateral (M-L) polarities (see, for 
example. Principles in Neural Science (3rd), eds. Kandel, Schwartz and Jessell, Elsevier 
Science Publbhing Company: NY, 1991; and Developmental Biology (3rd), ed. S.F. GUbert, 
Sinauer Associates: Sunderland MA, 1991). Inductive interactions that define tiie fate of 
cells within the neural tube establish the initial pattern of the embryonic vertebrate novous 
system. In the spinal cord, the identify of cell types is controlled, in part, by signals fiom two 
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midline cell groups, the notochord and floor plate, that induce neural plate cells to 
differentiate into floor plate, motor neurons, and other ventral neuronal types (van Straaten et 
al. (1988) Anat. Embryol. 177:317-324; Placzek et al. (1993) Development 117:205-218; 
Yamada et al. (1991) Cell 64:035-647; and Hatta et al. (1991) Nature 350:339-341). Iii 
addition, signals from the floor plate are responsible for the orientation and direction of 
commissural neuron outgrowth (Placzek, M. et al., (1990) Development 1 10: 19-30). Besides 
patterning the neural tube, the notochord and flooiplate are also responsible for producing 
signals which control the patterning of the somites by inhibiting differentiation of dorsal 
somite derivatives in the ventral regions (Brand-Saberi, B. et al., (1993) Anat. Embryol 188: 
239-245; Porquie, 0. et al., (1993) Proc. Natl Acad. ScL USA 90: 5242-5246). 

Another important signaling center exists in the posterior mesenchyme of developing 
limb buds, caUed the Zone of Polarizing Activi^, or "ZPA". When tissue from the posterior 
region of the Ihnb bud is grafted to the anterior border of a second limb bud, the resultant 
limb will develop with additional digits in a nmror-iniage sequence along the anteroposterior 
axis (Saunders and Gasselmg, (1968) Epithelial-Mesenchymal Interaction, pp. 78-97). This 
finding has led to the model that the ZPA is responsible for normal anteroposterior pattMning 
in the limb. The ZPA has been hypothesized to function by releasing a signal, termed a 
"morphogen", which forms a gradient across the early embryonic bud. According to this 
model, the fate of cells at different distances from the ZPA is determined by the local 
concentration of the morphogen, with specific thresholds of the morphogen inducmg 
successive structures (Wolpert, (1969) Theor. Biol 25:1-47). This is supported by the 
finding that the extent of digit duplication is proportional to the number of implanted ZPA 
cells (Tickle, (1981) Nature 254:199-202). 

A candidate for the putative ZPA morphogen was identified by the discovery that a 
source of retinoic acid can result in the same type of mirror-image digit diq)Iications when 
placed in the anterior of a limb bud (Tickle et al., (1982) Nature 296:564-565; SummerbeU, 
(1983) J. Embryol 78:269-289). The response to exogenous retinoic acid is concentration 
dependent as tiie morphogen model demands (Tickle et al., (1985) Dev. Biol 109:82-95). 
Moreover, a differential distribution of retinoic acid exists across the limb bud, with a higher 
concentration in the ZPA region (Thaller and Eichele, (1987) Nature 327:625-628). 

Recent evidence, however, has indicated that retinoic add is unlikely to be the 
endogenous fiwtor reqjonsible for ZPA activity (reviewed in Brookes, (1991) Nature 350:15; 
Tabin, (1991) Cell 66:199-217). It is now believed that rather than directly mimicking an 
endogenous signal, retinoic acid implants act by inducing an ectopic ZPA. The anterior limb 
tissue just distal to a retinoic acid implant and directly under the ectoderm has been 
demonstrated to acquire ZPA activity by serially transphmting that tissue to another lunb bud 
(SummerbeU and Harvey, (1983) Limb Development and Regeneration pp. 109-1 18; Wandc 
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et al., (1991) mture 350:81-83). Conversely, the tissue next to a ZPA graft does not gain 
ZPA activity (Smith. (1979) 1 Embryol 52:105-113). Exogenous retinoic acid would thus 
appear to act upstream of the ZPA in limb patterning. 

The immediate downstream targets of ZPA action are not known. However, one 
important set of genes which are ectopically activated during ZPA-induced pattern 
duplications are the 5' genes of the Hoxd cluster. These genes are noimally expressed in a 
nested pattern emanating fiom the posterior margin of the limb bud (DoUe et al., (1989) 
Nature 342:767-772; Izpisua-Behnonte et al., (1991) Nature 350:585-589). This nested 
pattern of Hox gene expression has been directly demonstrated to determine the identity of 
the structures produced along the anteroposterior axis of the limb (Morgan et al., (1993) 
Nature 358:236-239). As this would predict, ZPA grafts which produce mirror-image 
diq)lication of structures at an anatomical level first lead to the ectopic activation of the Hoxd 
genes in a mirror-image duplication at the molecular level. (Nohno et al., (1991) Cell 
64:1 197-1205; Izpisua-Behnonte et al., (1991) Nature 350:585-589). The molecuhir signals 
which regulate Ae expression of these inq)ortant genes are currently not understood. 

Summary of the InvetiHtm 

The present invention relates to the discovery of a novel femily of proteins present in 
vertebrate organisms, referred to hereinafter as "hedgehog" proteins, which proteins have 
apparent broad involvement in the formation and maintenance of ordered spatial 
arrangements of di£feraitiated tissues in vertebrates, and can be used to generate and/or 
maintain an anray of different vertebrate tissue both in vitro and in vivo. 

In general, the invention features hedgehog polypeptides, preferably substantially 
pure preparations of one or more of the subject hedgehog polypeptides. The invention also 
pro'/ides recombinantly produced hedgehog polypeptides. In preferred embodiments the 
polypeptide has a biological activity including: an ability to modulate proliferation, survival 
and/or differentiation of mesodermally-derived tissue, such as tissue derived from dorsal 
mesoderm; ihc ability to modulate proliferation, survival and/or differentiation of 
ectodeimally-derived tissue, such as tissue derived from ibe neural tube, neural crest, or head 
mesenchyme; the ability to modulate prolifisration, survival and/or differentiatioii of 
endodermally-derived tissue, sudi as tissue derived fix>m the primitive gut. Moreover, in 
preferred embodiments, the subject hedgehog proteins have the ability to induce expression 
of secondary signaUng molecules, such as members of the Transformmg Growth Factor p 
fiunily, as well as members of the fibroblast growth factor (FOF) family. 
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In a certain embodiments, the polypeptide is identical with or homologous to a Sonic 
hedgehog (Shh) polypeptide, such as a mammalian Shh represented by SEQ ID Nos:13 or 1 1, 
an avian Shh represented by SEQ ID No: 8, or a fish SWr represented by SEQ ID No: 12. For 
instance, the Shh polypeptide preferably has an amino acid sequence at least 70% 
homologous to a polypeptide represented by any of SEQ ID Nos: 8, 11, 12 or 13, though 
polypeptides with higher sequence homologies of, for example, 80%, 90% or 95% are also 
contemplated. Exemplary Shh proteins are represented by SEQ ID No. 40. The Shh 
polypeptide can comprise a full length protein, such as represented in the sequence listings, 
or it can comprise a fragment of, for instance, at least 5, 10, 20, 50, 100 or 150 ammo acids in 
length. Preferred hedgehog polypeptides include Shh sequences corresponding 
approximately to the natural proteolytic fragments of the hedgehog proteins, such as from 
about Cys-24 through Glu-188, or from about Asn-189 through Ala-475 of the human Shh 
protein, or analogous fiiagments thereto. 

In other embodiments, the polypeptide is identical with or homologous to an Indian 
hedgehog (Ihh) polypeptide, such as a human Jhh represented by SEQ ID No: 14, or a mouse 
i»A represented by SEQ ID No: 10. For instance, the Ihh polypeptide piefaably has an 
amino acid sequence at least 70% homologous to a polypq>tide represented by either of SEQ 
ID Nos: 10 or 14, though Ihh polypeptides with higher sequence homologies of, for exan^le, 
80%, 90% or 95% are also contemplated. The polypeptide can comprise the foil length 
protein represented by in part by these sequences, or it can comprise a fragment of, for 
instance, at least 5, 10, 20, 50, 100 or 150 amino acids in length. Preferred /Wj polypeptides 
comprise an N-termmal fragment including Arg-1 through Glu-94, or a C-teiminal fragment 
including His-95 through Ser-3312 of the human Ihh represented by SEQ'iD No: 14, or 
analogous fiiagments thereto. 

In still fiirther embodiments, the polypeptide is identical with or homologous to a 
Desert hedgehog (Dhh) polypeptide, such as a mouse Dhh represented by SEQ ID No: 9. For 
mstance, the Dhh polypeptide preferably has an amino acid sequence at least 70% 
homologous to a polypeptide represented by SEQ ID No: 9, though Dhh polypeptides with 
higher sequence homologies of, for example, 80%, 90% or 95% are also contemplated. The 
polypeptide can comprise the fiill length protem represented by this sequence, or it can 
comprise a fragment of, for instance, at least 5, 10, 20, 50, 100 or 150 amino acids in length. 
Preferred Dhh polypeptides comprise Dhh sequences corresponding to the N-terminal portion 
of the protein, e.g. Cys-23 through Asp-189 or Asn-190 through Gly-396 of SEQ ID No: 9, 
or analogous fragments thereto. 

Moreover, as described below, the hedgehog polypeptide can be either an agonist 
(e.g. mimics), or alternatively, an antagonist of a biological activity of a naturally occurring 
form of the protein, e.g., the polypeptide is able to modulate differentiation and/or growth 
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and/or survival of a cell responsive to authentic hedgehog proteins. Homologs of the subject 
hedgehog proteins include versions of the protein which are resistant to proteolytic cleavage, 
as for example, due to mutations which alter potential cleavage sequences or which inactivate 
« an enzymatic activity associated with the protein. 

5 The hedgehog polypeptides of the present mvention can be glycosylated, or 

• conversely, by choice of the ©cpression system or by modification of the protein sequence to 

preclude giycosylation, reduced carbohydrate analogs can also be provided. Glycosylated 
forms include derivatization with glycosaminoglycan chains. Likewise, hedgehog 
polypeptides can be generated ^^ch lack an endogenous signal sequence (though this is 
1 0 typically cleaved off even if present in the pro-form of the protein). 

The subject proteins can also be provided as chimeric molecules, such as in the form 
of fusion proteins. For instance, the hedgehog protein can be provided as a recombinant 
fusion protein which includes a second polypeptide portion, e.g., a second polypeptide 
having an amino acid sequence unrelated to hedgehogs e,g. the second polypeptide portion is 
1 5 glutathione-S-transferase, e.g. the second polypeptide portion is an enzymatic activity such as 
alkaline phosphatase, e.g. the second polypeptide portion is an epitope tag. 

Yet another aspect of the preset invention concerns an immunogen comprising a 
hedgehog polypeptide in an immunogenic preparation, the immunogen bemg capable of 
eliciting an immune response specific for a hedgehog polypq)tide; e.g. a humoral response, 
20 e.g. an antibody response; e.g. a cellular response. In preferred embodiments, the immunogen 
comprising an antigenic determinant, e«g« a imique determinant, firom a protein represented 
by one of SEQ ID Nos. 8-14. 

A still further aspect of the present invention features antibodies and antibody 
preparations specifically reactive with an epitope of the hedgehog immunogra. 

25 Another aspect of the present mvention provides a substantially isolated nucleic acid 

having a nucleotide sequence which encodes a hedgehog polypeptide. In preferred 
embodiments, the encoded polypeptide ^ecifically agonizes or antagonizes mductive events 
mediated by wild-type hedgehog proteins. The coding sequence of the nucleic acid can 
comprise a sequatice which is identical to a codmg sequence represented in one of SEQ ID 
^ 30 Nos: 1-7, or it can merely be homologous to one or more of those sequences. For instance, 
the hedgehog encoding sequence preferably has a sequence at least 70% homologous to a 
nucleotide sequence in one or more of SEQ ID Nos: 1 -7, though higher sequence homologies 
of, for example, 80%, 90% or 95% are also contemplated. The polypeptide encoded by the 
nucleic acid can comprise an amino acid sequence represented in one of SEQ ID Nos: 8-14 

35 such as one of those full length proteins, or it can comprise a fiagment of &at nucleic acid, 
which firagment may, for instance, encode a firagment which is, for example, at least 5, 1 0, 20, 



wo 95/18856 PCTAJS94/14992 

50 or 100 amino acids in length. The polypeptide encoded by the nucleic acid can be either 
an agonist (e.g. mimics), or alternatively, an antagonist of a biological activity of a naturally 
occurring form of a hedgehog protein. 

Furthermore, in certain preferred embodiments, the subject hedgehog nucleic acid 
S will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional 
promoter or transcriptional enhancer sequence, \^ch regulatory sequence is operably linked 
to the hedgehog gene sequence. Such regulatory sequences can be used in to render the 
hedgehog gene sequence suitable for use as an expression vector. 

In yet a further preferred embodiment, the nucleic acid hybridizes imder stringent 
10 conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of one 
or more of SEQ ID Nos: 1-7; though preferably corresponding to at least 20 consecutive 
nucleotides; and more preferably corresponding to at least 40, SO or 75 consecutive 
nucleotides of one or more of SEQ ID Nos: 1-7. 

The invention also features transgenic non-human animals, e.g. mice, rats, xabbits, 
15 chickens, fix)gs or pigs, having a transgene, e;g., animals which include (and preferably 
express) a heterologous form of a hedgehog gene described herein, or which misexpress an 
endogenous hedgehog gene, e.g., an animal in which expression of one or more of the subject 
hedgehog proteins is disnq)ted. Such a transgenic animal can serve as an animal model for 
studying cellular and tissue disorders comprising inulated or mis-dressed hedgehog alleles 
20 orforuseindrugscreenmg. 

The invention also provides a probe^rimCT comprising a substantially purified 
oligonucleotide, wherein the oligonucleotide comprises a region of nucleotide sequence 
which hybridizes under stringent conditions to at least 10 consecutive nucleotides of sense or 
antisense sequence of SEQ ID No: 1, or naturally occurring mutants hereof. Nucleic acid 

25 probes which are specific for each of the classes of vertebrate hedgehog proteins are 
contemplated by the present invention, e.g. probes which can discmi between nucleic acid 
encoding an Shh versus an Ihh versus a Dhh versus an Mhh. In inferred embodimrats, the. 
probe/primer fiuther includes a label group attached thereto and able to be detected. The 
label group can be selected, e.g., torn a group consisting of radioisotopes, fluorescent 

30 compounds, enzymes, and enzyme co-fiEu:tors. Probes of the invention can be used as a part 
of a diagnostic test kit for identifying dysfunctions associated with mis-expression of a 
hedgehog protein, such as for detecting in a sample of cells isolated fix)m a patient, a level of 
a nucleic acid encoding a subject hedgehog protein; e.g. measuriiig a hedgehog mRNA level 
in a cell, or determining whether a gmomic hedgehog gene has been mutated or deleted. 

35 Preferably, the oligonucleotide is at least 10 nucleotides in length, though primers of 20, 30, 
50, 100, or 150 nucleotides in length are also contemplated. 
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In yet another aspect, the invention provides an assay for screening test compounds 
for inhibitors, or alternatively, potentiators, of an interaction between a hedgehog protein and 
a hedgehog receptor. An exemplary method includes the steps of (i) combining a hedgehog 
receptor, either soluble or membrane bound (including whole cells), a hedgehog polypeptide, 
5 and a test compound, e.g., under conditions wherein, but for the test compound, the hedgehog 
protein and the hedgehog receptor are able to interact; and (ii) detecting the formation of a 
complex which includes the hedgehog protein and the receptor either by directly quantitating 
the complex or by measuring inductive effects of the hedgehog protein. A statistically 
significant change, such as a decrease, in the formation of the complex in the presence of a 
1 0 test compound (relative to what is seen in the absence of the test compound) is indicative of a 
modulation, e.g., inhibition, of the interaction between the hedgehog protein and the receptor. 

Another a^ect of the prexsA invention relates to a mefbod of inducing and/or 
maintaining a differentiated state, causing proliferation, and/or enhancing survival of a cell 
(fix)m a vertebrate organism) responsive to a hedgehog protein, by contacting the cells with a 

15 hedgehog a.gomst For example, the present method is applicable to cell culture technique, 
such as in the culturing of neuronal and other cells whose survival or differentiative state is 
dependent on hedgehog function. Moreover, hedgehog agonists and antagonists can be used 
for therapeutic intervention, such as to enhance survival and maintenance of neurons and 
other neural cells in both the central nervous system and the peripheral nervous system, as 

20 well as to influence other vertebrate organogenic pathways, such as other ectodermal 
patterning, as well as certain mesodermal and endodermal differentiation processes. In 
addition to the vertebrate hedgehog-]iks proteins, the present invention further contemplates 
the use of Drosophila Hedgehog (Dros-HH) to induce cells and tissue of vertebrate organisms 
in similar fashion to the subject hedgehog proteins. 

25 Another aspect of the present invention provides a method of determining if a subject, 

e.g. a human patient, is at risk for a disorder charac^rized by unwanted cell proliferation or 
aberrant control of differentiation. The method includes detecting, in a tissue of the subject, 
the presence or absence of a genetic lesion characterized by at least one of (i) a mutation of a 
gene encoding a hedgehog protein, e.g. represented in SEQ ID No: 2, or a homolog thereof; 

30 or (u) the mis-expression of a hedgehog gene. In preferred embodiments, detecting the 
genetic lesion includes ascertaining the existence of at least one of: a deletion of one or more 
nucleotides fi-om a hedgehog gene; an addition of one or more nucleotides to the gene, a 
substitution of one or more nucleotides of the gene, a gross chromosomal rearrangement of 
the gene; an alteration in the level of a messenger RNA transcript of the gene; the presence of 

35 a non-wild type splicing pattern of a messenger RNA transcript of the gene; or a non-wild 
type level of the protein. 
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For example, detecting the genetic lesion can include (i) providing a probe/primer 
including an oligonucleotide containing a region of nucleotide sequence which hybridizes to 
a sense or antisense sequence of a hedgehog gene, e.g. a nucleic acid represented in one of 
SEQ ID Nos: 1-7, or naturally occurring mutants thereof, or 5* or 3' flanking sequences 
5 naturally associated with the hedgehog gene; (ii) exposing the probe/primer to niicleic acid of 
the tissue; and (iii) detecting, by hybridization of the probe/primer to the nucleic acid, the 
presence or absence of the genetic lesion; e.g. wherein detecting the lesion comprises 
utilizing the probe/primer to determine the nucleotide sequence of the hedgehog gene and, 
optionally, of the flanking nucleic acid sequences. For instance, the probe/primer can be. 
10 employed in a polymerase chain reaction (PGR) or in a ligation chain reaction (LCR). In 
alternate embodiments, the level of a hedgehog protein is detected in an immunoassay using 
an antibody which is specifically immunoreactive with the hedgehog protein. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, 

15 microbiology, recombinant DNA, and immunology, which are within the skill of the art. 
Such techniques are explamed fully in the literature. See, for example. Molecular Cloning A 
Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor 
Laboratory Press: 1989); DNA Cloning, Volumes I and 11 (D. N. Glover ed., 1985); 
Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al. U.S. Patent No: 4,683,195; 

20 Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Transcription And 
Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells (R. I. 
Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B, 
Perbal, A Practical Guide To Molecular Cloning (1984); the treatise. Methods In Enzymology 
(Academic Press, Inc., N. Y,); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and 

25 M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 
and 155 (Wu et al. eds.), Immunochemical Methods In Cell And Molecular Biology (Mayer 
and Walker, eds., Academic Press, London, 1987); Handbook Of Experimental Immunology, 
Volumes I-IV (D. M. Weir and C. C. Blackwell, eds., 1986); Manipulating the Mouse 
Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). 

30 Other features and advantages of the invention will be apparent from the following 

detailed descrq>tion, and from the claims. 

Brief Description of the Drawings 

Figures 1 represents the amino acid sequences of two chick hh clones, chicken 
35 hedgehog'A (pCHA; SEQ ID No:35) and chicken hedgehog-B (pCHB; SEQ ID No:36). 
These clones were obtained using degenerate primers corresponding to the underlined amino 
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acid residues of the Drosophila sequence (corresponding to residues 161-232 of SEQ ID 
No:34) also shown in Figure 1, followed by nested PGR using chicken genomic DNA. 

Figure 2 is an alignment comparing the amino acid sequences of chick Shh (SEQ ID 
No:8) with its Drosophila homolog (SEQ ID No:34). Shh residues 1-26 correspond to the 
proposed signal peptide. Identical residues are enclosed by boxes and gaps in order to 
highlight sunilarity. The nucleotide sequence of Shh has been submitted to Gmbank. 

Figure 3 is a hydropathy plot for the predicted chick Shh protein, generated by the 
methods of Kyte and Doolittle (1982). The values of hydrophobicity are plotted against the 
amino acid positions. Negative values predict a hydrophobic domain of the protein. 

Figure 4 is an alignment comparing the amino acid sequences of various hh proteins. 
The white region on the amino terminus of chicken Shh corresponds to the putative signal 
peptide. The black box refers to a highly conserved region from aa residues 26-207 of SEQ 
ID No:8). The arrows point to exon boundaries in the Drosophila gene (Lee et ai. (1992) Cell 
71: 33-50). In each case, the proteins are compared to chicken Shh (SEQ ID No:8) and the 
percent amino acid identity is mdicated in each region's box. 

Figure 5A is a "pileup" alignment of predicted amino acid sequences which compares 
Drosophila hh (D-hh; SEQ ID No:34), mouse hh QA-Dhh; SEQ ID No:9; M-Ihh; SEQ ID 
No:10; M-Shh; SEQ ID No:l 1), chicken hh (C^h; SEQ ID No:8X and zebrafish hh (Z-Shh; 
SEQ ID No: 12). The predicted hydrophobic transmembrane/signal sequences are indicated 
in italics and the predicted signal sequence processing site is arrowed. The positions of 
introns interrupting ibs Drosophila hh and M-D/r/r open reading frames are indicated by 
arrowheads. All amino acids shared among the six predicted Mt proteins are indicated in 
bold. Figure SB is a sequence alignment of the N-terminal portion of vertebrate hedgehog 
proteins, and the predicted degenerate sequence "CON" (SEQ ID No: 41). 

Figure 6 is an inter- and cross-^ecies comparison of amino acid identities among the 
predicted processed hh proteins shown in Figure 5A. All values are percentages. Figures in 
parentheses represent similarities allowing for conservative amino acid substitutions. 

Figure 7 is a representation of the DNA constructs used in transgenic studies to study 
ectopic expression of chick Shh in mouse embryos. Constructs were generated for ectopic 
expression of cDNA clones in the Wnt-l expression domain and tested in transgenic mice 
embryos using a lac-Z reporter (pWEXP-lacZ (used as a control)) and a chick Shh reporter 
(pWEXP-CShh). The pWEXP-CShh construct contained two tandem head to tail copies of a 
chick Shh cDN A. The results of WEXP2-CShh transgenic studies are shown in Table 1 • 

Figure 8 is a model for anterioposterior limb patterning and the Zone of Polarizing 
Activity (ZPA), based on Saunders and Gasseling (1968). The left portion of the diagram 
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schematizes a stage 20 limb bud. The somites are iUiistrated as blocks along the left margin 
of the limb bud; right portion of the same panel illustrates the mature wing. The hatched 
region on the posterior limb is the ZPA. Normally, the developed wing contains three digits 
n, III, and IV. The figure further shows the result of transplanting a ZPA from one limb bud 
5 to the anterior margin of another. The mature limb now contains six digits IV, HI, II, II, III, 
and rv in a mirror-image duplication of the normal pattern. The large arrows in both panels 
represent the signal produced by the ZPA which acts to specify digit identity. 

Figures 9A and 9B illustrate the comparison of zebrafish Shh (ZShh) and Drosophila 
hh (hh) amino acid sequences. Figure 9A is an alignment of zebrafish Shh and Drosophila hh 

10 amino acid sequences. Identical amino acids are linked by vertical bars. Dots indicate g^s 
introduced for optimal alignment. Putative transmemfarane/signal peptide sequences are 
underlined (Kyte and Doolittle (1982) J Mol Biol 157:133-148). The position of exon 
boundaries in die Drosophila gene are indicated by arrowheads. The region of highest 
similarity between Z-Shh and hh overlaps exon 2. Figure 9B is a schematic comparison of Z- 

15 Shh and Drosophila hh. Black boxes indicate the position of the putative 
' transmraibrane/signal peptide sequences, relative to the amino-tmnintis. Sequence 
homologies were scored by takmg mto account the alignment of chemically similar amino 
acids and percentage of homology in the boxed regions is indicated. 

Figure 10 is an alignment of partial predicted amino acid sequences firom three 
20 different zebrafish hh homologs. One of these sequences corresponds to Shh, vMLq the o&er 
two define additional hh homologs in zebrafish, named AA(a) and hh(Jb), Amino acid 
identities among the three partial homologs are indicated by vertical bars^ 

Figure 11 is a schematic representations of chick and mouse Shh proteins. The 
putative signal peptides and Asn-linked glycosylation sites are shown. The numbers refer to 
25 amino acid positions. 

Figure 12 is a schematic representation of myc-tagged Shh constructs. The positions 
of tiie c-myc epitope tags are shown, Ieis is the predicted position of the proteolytic cleavage 
site. The shaded area following the signal peptide of the carboxy terminal tagged construct 
represents the region included in the Olutatfaione-S-transferase fusion protein used to generate 
30 antisera in rabbits. 

Figure 13 is a schematic diagram of Shh processing. Illustrated are cleavage of the 
signal peptide (black box), glycosylation at the predicted Asn residue (N), and the secondary 
proteolytic cleavage. The question marks indicate that tiie precise site of proteolytic cleavage 
has not been determined. The different symbols representing tiie carbohydrate moie^ 
35 indicated maturation of this structure in the Golgi apparatus. The dashed arrow leading from 
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the signal peptide cleaved protein indicates that secretion of this species may be an artifact of 
the incomplete proteolytic processing of Shh seen mXenopus oocytes and cos cells. 

Figure 14 is a schematic diagram of a model for the coordinated growth and 
5 patterning of the limb. Sonic is proposed to signal directly to the mesoderm to induce 
expression of the Hoxd and Bmp-2 genes. The induction of these mesodermal genes requires 
competence signals fiom the overlying AER. One such signal is apparently Fgf'4. 
£3q»ession of Fgf-4 in the AER caii be induced by Sonic providing an indirect signaling 
pathway from SorUc to the mesoderm. FGFs also maintain expression of Sonic in the ZPA, 
10 thereby completing a positive feedback loop which controls the relative positions of the 
signaling centers. While Fgf-4 provides competence signals to the mesodenn, it also 
promotes mesodermal proliferation. Thus patterning of the mesodenn is dependent on the 
same signals which promote its proliferation. This mechanism inextricably integrates limb 
patterning with outgiowdi. 

15 

Figure IS is a schematic diagram of patterning of the DrosophOa and vertebrate gut 
Regulatory interactions responsible for patterning of Drosophila midgut (A) are compared to 
a model for patterning of the vertebrate hindgut (B) based on ejq»ession data. Morphologic 
regional distinctions are indicated to the left (A and B), genes expressed in the visceral 

20 mesoderm are in the center panel, those in the gut lumenal endoderm are on the right 
HOM/Hox gene expression domains are boxed. Regionally expressing secreted gene 
products are mdicated by lines. Arrows indicate activating interactions, barred lines, 
inhibitmg interactions. Regulatory interactions in Drosophila gut (A) have been established 
by genetic studies except for the relationship between dpp and hedgehog, vrfiich is 

25 hypothesized based on their interactions in the Drosophila imagmal discs, hedgehog appears 
to be a signal from the endoderm to the mesoderm, and that t^p is expressed in the 
mesodenn. 

Figure 16 is a schematic diagram of chromosomal locations of Ihh, Shh and Dhh in 
30 the mouse genome. The loci were mapped by mterspecific backcross analysis. The 
segregation patterns of the loci and flanking genes in backcross animals that were typed for 
all loci are ^own above the chromosome maps. For individual pairs of loci more animals 
were typed. Each column represents the chromosome idoitified in the backcross progeny 
that was inherited from the (C57BL/6J x M spretus) Fl parent The shaded boxes represent 
35 the presence of a C57BL/6J allele and white boxes represent the presence of a A/, spretus 
allele. The number of the oi&prings inheriting each type of chromosome is listed at the 
bottom of each column. Partial chromosome linkage maps showing location of Ihh, Shh and 
Dhh in relation too linked genes is shown. The numbo- of recombinant N2 animals is 
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presented over total number of N2 animals typed to the left of the chromosome maps between 
each pair of loci. The recombinant frequencies, expressed as genetic distance in 
centimorgans (± one standard error) are also shown. When no recombination between loci 
was detected, the upper 95% confidence limit of the recombination distance is indicated in 
parentheses. Gene order was determined by minimizing the number of recombinant events 
required to explain the allele distribution patterns. The position of loci in human 
chromosomes can be obtained from GDB (Genome Data Base), a computerized database of 
human linkage information maintained by the William H. Welch Medical Library of the John 
Hopkins University (Baltimore, MD). 

Detailed Description of the Invention 

Embiyonic inductive signals are key regulatory proteins that function in vertebzate 
pattern fomiation, and are present in important signaling centers known to operate 
embryonically to define the organization of the vertebrate embryo. For example, these 
signaling structures include the notochord, a transient structure which initiates the fdmiation 
of the nervous system and helps to define the different types of neurons within it. The 
notochord also regulates mesoderaial patterning along the body axis. Another distmct group 
of cells having apparent signaling activity is the floorplate of the neural tube (the precursor of 
the spinal cord and brain) which also signals the differentiation of diflferent nerve cell types. 
It is also generally believed that the region of mesoderm at the bottom of the buds which 
form the limbs (called the Zone of Polarizing Activity or ZPA) operates as a signaling center 
by secreting a moiphogen which ultimately produces the correct pattemmg of the developing 
limbs. 

The present invention concerns the discovery that proteins encoded by a &mily of 
vertebrate genes, termed here hedgehog-relsted genes, comprise the signals produced by 
these embryonic patterning centers. As described herein, each of tiie disclosed vertebrate 
hedgehog (hh) homologs exhibits spatially and temporally restricted expression domains 
indicative of important roles in embryonic patterning. For instance, the results provided 
below indicate that vertebrate hh genes are expressed in the posterior limb bud, Hensen*s 
node, the early notochord, the floor plate of the neural tube, the fore- and hindgut and their 
derivatives. These are all important signaling centers known to be required for proper 
patterning of surrounding embiyonic tissues. 

The Hedgehog family of vertebrate inter-cellular signaling molecules provided by the 
present invention consists of at least four members. Three of these members, herein referred 
to as Desert hedgehog {Dhh\ Sonic hedgehog {Shh) and Indian hedgehog (Ihh), exist in all 
vertebrates, including fish, birds, and mammals. A fourth member, herem referred to as 
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Moomat hedgehog {Mhh), appears specific to fish. According to the appended sequence 
listing, (see also Table 1) a chicken Shh polypeptide is encoded by SEQ ID No:l; a mouse 
Dhh polypeptide is encoded by SEQ ID No;2; a mouse Ihh polypeptide is encoded by SEQ 
ID No:3; a mouse Shh polypeptide is encoded by SEQ ID No:4 a zebrafish Shh polypeptide is 
5 encoded by SEQ ID No:5; a human Shh polypeptide is encoded by SEQ ID No:6; and a 
human IMi polypeptide is encoded by SEQ ID No:7. 

Table 1 

Guide to vertebrate /let/fe/tof sequences 

10 



Chicken 5/1/2 
Mouse DM 
Mouse 
Mouse Shh 
Zebrafish Shh 
Human 57z/i 
Human j?i/t 

Certain of the vertebrate /ferf^etog proteins (AA) of the present invention are define 
by SEQ ID Nos:8-14 and can be cloned fi-om vertebrate organisms including fish, avian and 
mammalian sources. These protems are distinct from the Drosophila protein referred to in 
the literature as a hedgehog protein which, for clarity, will be referred to hereinafter as "Dros- 
15 HH". In addition to the sequence variation between the various hh homologs, the vertebrate 
hedgehog proteins are apparently present naturally in a number of diflfeient fonns, including a 
pro-form, a fiill-length mature form, and several processed fiagments thereof The pro-form 
includes an N-temiinal signal peptide for directed secretion of the extracellular domain, while 
the fiill-length mature form lacks this signal sequence. Further processing of the mature form 
20 apparently occurs in some instances to yield biologically active fiagments of Ae protein. For 
instance, sonic hedgehog undergoes additional proteolytic processing to yield two peptides of 
q)proximately 19 kDa and 27 kDa, both of which aie secreted. In addition to proteolytic 
fiagmentation, the vertebrate hedgehog proteins can also be modified post-ttanslationally, 
such as by glycosylation, though bacterially produced (e.g. unglycosylated) forms of the 
, 25 proteins ^parently still maintain at least some of the activity of the native protein. 

As described in the following examples, the cDNA clones provided by the present 
invention were first obtained by screenmg a mouse genomic library with a partial Drosophila 
hh cDNA clone (.7kb). Positive plaques were identified and one mouse clone was selected. 
This clone was then used as a probe to obtain a genomic clone containing the full coding 
30 sequence of the Mouse Dhh gene. As described in the attached Examples, Northern blots and 
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SEQ ID No. 1 
SEQ ID No. 2 
SEQ ID No. 3 
SEQ ID No. 4 
SEQ ID No. 5 
SEQ ID No. 6 
SEQ ID No. 7 



Amino Acid 



SEQ ID No. 8 
SEQ ID No. 9 
SEQ ID No. 10 
SEQ ID No. II 
SEQ ID No. 12 
SEQ ID No. 13 
SEQ ID No. 14 
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in situ hybridization demonstrated that Mouse Dhh is expressed in the testes, and potentially 
the ovaries, and is also associated with sensory neurons of the head and trunk. Dhh is clearly 
a secreted factor expressed by Sertoli cells in the male testes, which is required for 
maintenance of the male germ line as probably a mitotic and survival factor. Dhh mutants 
- 5 are male sterile. Furthermore, Dhh is expressed as one of the first signs of differentiation of 

the gonad, thus Dhh may be a target of the sex determining gene, Sry. Interestingly, no ' 
expression was detected on the nerve cell bodies themselves (only the axons), indicating that 
D/i^ is likely produced by the Shwann cells. 

In order to obtain cDNA clones encoding chicken hh genes, degenerate 
10 oligonucleotides were designed corresponding to the amino and carboxy ends of Drosophila 
hh exon 2. As described in tiie Examples below, these oligonucleotides were used to isolate 
PCR fi:agments fit>m chicken genomic DNA. These fiagments wm then cloned and 
sequenced. Ten clones yielded two different hh homologs, chicken Dhh and chicken Shk, 
The chicken Shk clone was then used to screen a stage 21/22 limb bud cDNA libraiy which 
1 S yielded a full length SMi clone. 

In order to identify other vertebrate hedgehog homologs, the chicken clones (Dhh and 
Shh) were used to probe a genomic southern blot containing chicken DNA. As described 
below, genomic DNA was cut with various enzymes vAdch do not cleave within the probe 
sequences. The DNA was run on a gel and transferred to a nylon filter. Probes were derived 

20 by ligating each 220 bp clone into a concatomer and then labeling with a random primer kit 
The blots were hybridized and washed at low stringency. In each case, three hybridizing 
bands were observed following autoradiogrcq)hy, one of vAAch was significantly more intense 
(a different band with each probe), indicating ftat there are at least three vertebrate hh genes. 
Additional cDNA and genomic soreens carried out have yielded clones of three hh homologs 

25 firom chickens and mice (Shh^ Dhh and Ihh\ and four hh homologs from zefarafish (Shh, Dhh, 
Ihh and MhK). Weaker hybridization signals suggested that the gene family may be even 
larger. Moreover, a number of weakly hybridiang genomic clones have been isolated. 
Subsequently, the same probes derived fix>m chicken hedgehog homologs have been utilized 
to screen a human genomic library. PCR fiagments derived firom the human genomic libraiy 

30 were then sequenced, and PCR probes derived firom the human sequences were used to screen 
human fetal cDNA libraries. Full-length cDNA encoding human sonic hedgehog protein 
(Shh) and partial cDNA encoding human Indian hedgehog protein (Ihh) were isolated from 
the fetal libraiy, and represent a source of recombinant human hedgehog proteins. 

To order to deteimine the expression patterns of fiie various vertebrate hh homologs, 
35 in situ hybridizations were performed in developing embryos of chicken, mice and fish. As 
described in the Examples below, the resulting expression patterns of each hh homolog were 
similar across each species and revealed that hh genes are expressed in a number of important 
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embryonic signaling centers. For example, Shh is expressed in Hensen*s node, the notochord, 
the ventral floorplate of the developing neural tube, and the ZPA at the base of the limb buds. 
Shh is also expressed in diffemtiated motor neiirons in the embryonic motise (at 11 .5 days 
post fertilization), therefore, Shh may play a role in later stages of motor neuron 
5 development, perhaps in proliferation, but more likely in survival of this cell population. Ihh 
is expressed in the embryonic yolksac and hindgut, and appear also to be involved in 
chondrogenesis; Dhh is expressed in the testes; and Aihh (only in zebrafish) is expressed in 
the notochord and in certain cranial nerves. 

Furthermore, experimental evidence indicates that certain hedgehog proteins initiate 
10 e^qpression of secondary signaling molecules, including Bmp'2 (a TGF-p relative) in the 
mesoderm and Fgf^ in the ectoderm. The mesoderm requires ectodermally-derived 
competence factor(s), which include Fgf4, to activate tai]get gene expression in response to 
hedgehog signaling. The e)q>ression of, for example, Sonic and Fgf-4 is cooidinately 
regulated by a positive feedback loop operating between the posterior mesoderm and the 
1 5 overlying AER, which is the ridge of pseudostratified epithelium extending antero-posteriorly 
along the distal margm of the bud. These data provide a basis for understanding the 
integration of growth and patterning in fhe developing limb vdiich can have important 
implications in the treatmrat of bone disorders described in greater detail herein. 

To determine the role hedgehog proteins plays in inductive interactions between the 
20 endodenn and mesoderm, which are critical to gut morphogenesis, in situ hybridizations and 
recombinant retroviral injections were performed in developing chick embryos. The ventral 
mesoderm is induced to undergo gut-specific differentiation by the adjacent endoderm. As 
described in Examples below, at the earliest stages of chick gut formation Shh is expressed by 
the endoderm, and BMP'4 (a TGF-P relative) is expressed in the adjacent visceral mesoderm. 
25 Ectopic expression of Sonic is sufficient to induce «q)ression of BMP'4 in visceral 
mesoderm, suggesting that Sonic serves as an inductive signal from the endoderm to the 
mesodemi. Subsequent organ-specific endodermal differentiation dq>ends on regional 
inductive signal from the visceral mesoderm. Hox genes are expressed in the undifferentiated 
chick hind gut mesoderm with boundaries corresponding to morphologic borders, suggesting 
30 a role in regulating gut morphogenesis. 

Accordingly, certain aspects of the present invmtion relate to nucleic acids encoding 
vertebrate hedgehog proteins, the hedgehog protems themselves, antibodies immimoreactive 
with hh proteins, and preparations of such compositions. Moreover, the present invention 
provides diagnostic and ther^utic assays and reagents for detecting and treating disorders 
35 involving, for example, aberrant expression of vertebrate hedgehog homologs. In addition, 
drug discovery assays are provided for identifying agents which can modulate the binding of 
vertebrate hedgehog homologues to hedgehog-hmiing moieties (such as hedgehog receptors. 
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ligands, or other e?ctracellular matrix components). Such agents can be useful therapeutically 
to alter the growth and/or differentiation of a cell. Other aspects of the invention are 
described below or will be apparent to those skilled in the art in light of the present 
disclosure. 

5 For convenience, certain terms employed in the specification, examples, and 

appended claims are collected here. 

As used herein, the term "nucleic acid" refers to polynucleotides such as 
deoxyribonucleic acid (DNA), and, where, appropriate, ribonucleic acid (RNA). The temi 
should also be understood to mclude, as equivalents, analogs of either RNA or DNA made 
10 fix)m nucleotide analogs, and, as gq^plicable to the embodiment being described, single (sense 
or antisense) and double-sftranded polynucleotides. 

As used herein, the term "gene" or "recombinant gene" refers to a nucleic acid 
comprising an open reading frame encoding one of the vertebrate hh polypeptides of the 
present invention, including both exon and (optionally) intron sequences. A "recombinant 

15 gene" refers to nucleic acid encodmg a vertebrate hh polypeptide and comprismg vertebrate 
/{^-encoding exon sequences, though it may optionally include intron sequences which are 
either derived from a chromosomal vertebrate hh gene or from an unrelated chromosomal 
gene. Exemplary recombinant genes encoding the subject vertebrate JiA polypeptide are 
represented by SEQ ID No:l, SEQ ID No:2, SEQ ID No:3, SEQ ID No:4, SEQ ID No:5, 

20 SEQ ID No:6 or SEQ ID No:7. The term "intron" refers to a DNA sequence present in a 
given vertebrate hh gene which is not translated into protein and is gently found between 
exons. 

As used herein, the term "transfection" means the mtroduction of a nucleic acid, 
e.g., an expression vector, into a recipient cell by nucleic acid-mediated gene transfer. 
25 "Transformation", as used herein, refers to a process in vMch a cell's genotype is changed as 
a result of the cellular uptake of exogenous DNA or RNA, and, for example, the transformed 
cell expresses a recombinant form of a vertebrate hh polypeptide or, where anti-sense 
expression occurs from the transfeixed gene, the rapression of a naturally-occurring form of 
the vertebrate hh protein is disriQ^ted. 

30 As used herein, the term "vector" refers to a nucleic acid molecule capable of 

transporting another nucleic acid to which it has been linked. One type of preferred vector is 
an episome, i.e., a nucleic acid capable of extra-chromosomal replication. Preferred vectors 
are those capable of autonomous replication and/expression of nucleic acids to which they are 
linked. Vectors c^ble of directing the expression of genes to \^ch they are operatively 

35 linked are refeired to herein as "expression vectors". In general, expression vectors of utility 
in recombinant DNA techniques are often in the form of "plasmids" which refer generally to 
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circular double stranded DNA loops which, in their vector fonn are not bound to the 
chromosome. In the present specification, "plasmid" and "vector" are used interchangeably as 
the plasmid is the most commonly used form of vector. However, the invention is intended to 
include such other forms of expression vectors which serve equivalent functions and which 
5 become known in the art subsequently hereto. 

"Transcriptional regulatory sequence" is a generic term used throughout the 
specification to refer to DNA sequences, such as initiation signals, enhancers, and promoters, 
Mdiich induce or control transcription of protein coding sequences witii which they are 
operably linked. In preferred embodiments, transcription of one of the recombinant 

10 vertebrate hedgehog genes is under the control of a promoter sequence (or other 
transcriptional regulatory sequence) which controls the expression of the recombinant gene in 
a cell-type in which e^qiression is mtended. It will also be understood that the recombiiumt 
gene can be under the control of transoiptional regulatory sequences which are the same or 
which are diffarent firom those sequences vAdch control transcription of the naturally- 

1 5 occurring forms of hedgehog proteins. 

As used herein, the term "tissue-specific promoter" means a DNA sequence that 
serves as a promoter, i.e., regulates expression of a selected DNA sequence operably 
linked to the promoter, and which effects expression of the selected DNA sequence in 
specific cells of a tissue, such as cells of neural origin, e.g. neuronal cells. The term also 
20 covers so-called "leaky" promoters, which regulate expression of a selected DNA primarily 
in one tissue, but cause expression in other tissues as well. 

As used herein, a "transgenic animal" is any anunal, preferably a non-human 
mammal, bird or an amphibian, in which one or more of the cells of the animal contain 
heterologous nucleic acid introduced by way of human intervention, such as by transgenic 

25 techniques well known m the art. The nucleic acid is introduced into the cell, directly or 
indirectly by introduction into a precursor of the cell, by way of deliberate genetic 
manipulation, such as by microinjection or by infection with a recombinant virus. The term 
genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but 
ratixer is directed to the introduction of a recombinant DNA molecule. This molecule may be 

30 integrated within a chromosome, or it may be extrachromosomally replicating DNA. In the 
typical transgenic animals described herein, the transgene causes cells to express a 
recombinant form of one of the vertebrate hh proteins, e.g. either agonistic or antagonistic 
forms. However, transgenic animals in which the recombinant vertebrate Mi gene is silent 
are also contemplated, as for exanq)le, the FLP or CRE recombinase dependent constructs 

35 described below. The "non-human animals" of tiie invention include vertebrates such as 
rodents, non-human primates, sheep, dog, cow, chickens, amphibians, reptiles, etc. Preferred 
non-human animals are selected JBx)m the rodent femily mcluding rat and mouse, most 
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preferably mouse, though transgenic amphibians, such as members of iheXenopus genus, and 
transgenic chickens can also provide important tools for understanding and identifying agents 
which can affect, for example, embryogenesis and tissue formation* The term "chimeric 
animal" is used herein to refer to animals in which the recombinant gene is found, or in which 
5 the recombinant is expressed in some but not all cells of the animal. The tenn "tissue- 
specific chimeric animal" indicates that one of the recombinant vertebrate hh genes is present 
and/or expressed in some tissues but not others. 

As used herein, the term "transgene*" means a nucleic acid sequence (encoding, e.g., 
one of the vertebrate hh polypeptides), which is partly or entirely heterologous, i.e., foreign, 

10 to the transgenic animal or cell into which it is introduced, or, is homologous to an 
endogenous gene of the transgenic animal or cell into i;^ch it is introduced, but which is 
designed to be inserted, or is inserted, into the animal's genome in such a way as to alter the 
genome of the cell into which it is inserted (e.g., it is inserted at a location \Adch differs from 
that of the natural gene or its insertion results in a knockout). A transgene can include one or 

15 more transcriptional regulatory sequences and any other nucleic acid, such as introns, tiiat 
niay be necessaiy for optinial expression of a selected nucte^^ 

As is well known, genes for a particular polypeptide may exist in single or multiple 
copies within the genome of an individual. Such duplicate genes may be identical or may 
have certain modifications, including nucleotide substitutions, additions or deletions, v^ch 

20 all still code for polypeptides having substantially the same activity. The tenn **DNA 
sequence encoding a vertebrate hh polypeptide'* may thus refer to one or more genes within a 
particular individual. Moreover, certain differences in nucleotide sequences may exist 
between individual organisms, which are called alleles. Such allelic dififerences may or may 
not result in differences in amino acid sequence of the encoded polypeptide yet still mcode a 

25 protein with the same biological activity. 

"Homology" refers to sequence similarity between two peptides or between two 
nucleic acid molecules. Homology .can be determined by comparing a position in each 
sequence ^^ch may be aligned for purposes of comparison. When a position in the 
compared sequence is occupied by the same base or amino acid, then the molecules are 
30 homologous at that position. A degree of homology between sequences is a function of die 
number of matching or homologous positions shared by the sequences. An "unrelated" or 
"non-homologous" sequence shares less than 40 percent identity, though preferably less than 
25 percent identity, with one of the vertebrate hh sequences of the present invention. 

"Cells," "host cells" or "recombinant host cells" are terms used interchangeably 
35 herein. It is understood that such terms refer not only to the particular subject cell but to the 
progeny or potential progeny of such a cell. Because certain modifications may occur in 
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succeeding generations due to either mutation or environmental influences, such progeny 
may not, in fact, be identical to the parent cell, but are still included within the scope of the 
term as used herein. 

A "chimeric protein" or "fusion protein" is a fusion of a first amino acid sequence 
5 encoding one of the subject vertebrate hh polypeptides with a second amino acid sequence 
defining a domain foreign to and not substantially homologous with any domain of one of the 
vertebrate hh proteins. A chimeric protein may present a foreign domain which is found 
(albeit in a different protein) in an organism vAdch also expresses the first protem, or it may 
be an "interspecies", "mtergenic", etc. fusion of protein structures e5q)ressed by different 
1 0 kmds of organisms. In general, a fusion protein can be represented by the general formula X- 
AA-Y, wherein hh represents a portion of the protein which is derived from one of the 
vqtehrate hh proteins, and X and Y are independently absent or represent amino acid 
sequences which are not related to one of the vertebrate hh sequences in an organism, 
including naturally occurring mutants. 

15 As used herein, the terms "transforming growth factor-beta" and "TGF-p" denote a 

&mily of structurally related paracrine polypeptides foxmd ubiquitously in vertebrates, and 
prototypic of a large femily of metazoan growth, differentiation, and morphogenesis factors 
(see, for review, Massaque et al. (1990) Am Rev Cell Biol 6:597-641 ; and Spom et al. (1 992) 
J Cell Biol 119:1017-1021). Included m this femily are the "bone moiphogenetic proteins" 

20 or "BMPs", which refers to proteins isolated from bone, and fragments thereof and synthetic 
peptides v^iiich are capable of inducing bone deposition alone or when combined with 
appropriate cofactors. Preparation of BMPs, such as BMP-1, -2, -3, and -4, is described in, 
for example, PCT publication WO 88/00205. Wozney (1989) Growth Fact Res 1:267-280 
describes additional BMP proteins closely related to BMP-2, and which have been designated 

25 BMP.5, -6, and -7. PCT publications WO89/09787 and WO89/09788 describe a protein 
called "OP-1," now known to be BMP-7. Other BMPs are known in the art. 

The term "isolated" as also used herein with respect to nucleic acids, such as DNA or 
RNA, refCTS to molecules separated fix)m other DNAs, or RNAs, respectively, that are present 
m the natural source of the macromolecule. For example, an isolated nucleic acid encoding 

30 one of the subject vertebrate hh polypeptides preferably includes no more than 10 kilobases 
(kb) of nucleic acid sequence which naturally immediately flanks the vertebrate hh gene in 
genomic DNA, more preferably no more than 5kb of such naturally occurring flanking 
sequences, and most preferably less than 1.5kb of such naturally occurring flanking sequence. 
The term isolated as used herein also refers to a nucleic acid or peptide that is substantially 

35 free of cellular material, viral material, or culture medium v^en produced by recombinant 
DNA techniqxies, or chemical precursors or other chemicals when chemically synthesized. 
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Moreover, an "isolated nucleic acid*' is meant to include nucleic acid fragments which are not 
naturally occurring as fragments and would not be found in the natural state. 

As described below, one aspect of the invention pertains to isolated nucleic acids 
comprising the nucleotide sequences encoding vertebrate /lA homologues, and/or equivalents 
of such nucleic acids. The term nucleic acid as used hmin is intended to include fragments 
as equivalents. The term equivalmt is understood to include nucleotide sequences encoding 
functionally equivalent hh polypeptides or functionally equivalent peptides having an activity 
of a vertebrate hh protein such as described herein. Equivalent nucleotide sequences will 
include sequences that differ by one or more nucleotide substitutions, additions or deletions, 
such as allelic variants; and will, therefore, include sequences that differ from the nucleotide 
sequence of the vertebrate hh cDNAs shown in SEQ ID Nos:l-7 due to the degeneracy of the 
genetic code. Equivalents will also include nucleotide sequences that hybridize under 
stringent conditions (i.e., equivalent to about 20-27*^C below tiie melting temperature (Tjq) of 
the DNA duplex formed in about IM salt) to the nucleotide sequences represented in SEQ ID 
Nos:l-7. In one embodiment, equivalents will further include nucleic acid sequences derived 
from and evolutionarily related to, a nucleotide sequences shown in any of SEQ ID Nos: 1-7. 

Moreover, it will be generally appreciated that, und^ certain circumstances, it may be 
advantageous to provide homologs of one of the subject hedgehog polypeptides which 
function in a limited c^)acity as one of either an hh agonist or an /rA antagonist, in order to 
20 promote or inhibit only a subset of the biological activities of the naturallyoccuning form of 
the protein. Thus, specific biological effects can be elicited by treatment witii a homolog of 
limited function, and with fewer side effects relative to treatment with agonists or antagonists 
which are directed to all of the biological activities of naturally occurring forms of hedgehog 
proteins. 

25 Homologs of one of the subject hedgehog proteins can be generated by mutagenesis, 

such as by discrete point mutation(s), or by truncation. For instance, mutation can give rise 
to homologs which retain substantially the same, or merely a subset, of the biological activity 
of the hh polypeptide from which it was derived. Alternatively, antagonistic forms of the 
protem can be generated wiiich are able to inhibit the function of the naturally occurring form 

30 of the protem, such as by competitively binding to an M receptor. 

Polypeptides referred to herein as having an activity of a vertebrate hh protein are 
defined as peptides that have an amino acid sequence corresponding to all or a portion of the 
amino acid sequences of a vertebrate hh protems shown in any of SEQ ID No:8, SEQ ID 
No:9, SEQ ID No:10, SEQ ID No:l 1, SEQ ID No:12, SEQ ID No:13 or SEQ ID No:14 and 
35 which have at least one biological activity of a vertebrate hh protein. Examples of such 
biological activity of a vertebrate hh protein mclude the ability to induce (or otherwise 
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modulate) fonnation and differentiation of the head, limbs» lungs, central nervous system 
(CNS), or mesodermal patterning of developing vertebrate embryos. In preferred 
embodiments, the biological activity can comprise an ability to regulate neurogenesis, such 
as a motor neuron inducmg activity, a neuronal differentiation inducing activity, or a 
5 neuronal survival promoting activity. Hedgehog proteins of the present invention can also 
have biological activities which include an ability to regulate organogensis, such as through 
the ability to influence limb patterning, by, for example, skeletogenic activity. The biological 
activity associated with the hedgehog proteins of the present invention can also include the 
ability to induce stem cell or germ cell differentiation, including the ability to induce 

10 differentiation of chondrocytes or an involvement in spermatogenesis. Hedgehog proteins of 
the present invention can also be characterized in terms of biological activities which include: 
an ability to modulate proliferation, survival and/or differentiation of mesodermally-derived 
tissue, such as tissue derived from dorsal mesoderm; the ability to modulate proliferation, 
survival and/or differaitiation of ectodermally-derived tissue, such as tissue derived from the 

15 neural tube, neural crest, or head mesenchyme; the ability to modulate proliferation, survival 
and/or differentiation of endodermally-derived tissue, such as tissue derived from the 
primitive gut. Moreover, as described in the Examples below, the subject hedgehog proteins 
have the ability to induce expression of secondary signaling molecules, such as members of 
the Transforming Growth Factor |J (TGFp) family, including bone morphogenic proteins, e.g. 

20 BMP-2 and BMP'4, as well as members of the fibroblast growth factor (FGF) fEonily, such as 
Fgf4. Other biological activities of the subject hedgehog proteins are described herein or 
will be reasonably apparent to those skilled in the art. According to the present invention, a 
polypeptide has biological activity if it is a specific agonist or antagonist of a naturally- 
occurring form of a vertebrate hedgehog protein, 

25 Preferred nucleic acids encode a vertebrate hedgehog polypeptide comprising an 

amino acid sequence at least 60% homologous, more preferably 70% homologous and most 
preferably 80% homologous with an amino acid sequence selected from the group consisting 
of SEQ ID Nos:8-14. Nucleic acids , which encode polypeptides at least about 90%, more 
preferably at least about 95%, and most preferably at least about 98-99% homology vsdth an 

30 amino acid sequence represented in one of SEQ ID Nos:8-14 are also within the scope of the 
invention. In one embodiment, the nucleic acid is a cDNA encoding a peptide having at least 
one activity of the subject vertebrate hh polypeptide. Preferably, the nucleic acid includes all 
or a portion of the nucleotide sequence corresponding to the coding region of SEQ ID Nos:l- 
7. 

35 Another aspect of the invention provides a nucleic acid which hybridizes under high 

or low stringency conditions to a nucleic acid represented by one of SEQ ID Nos:l-7. 
Appropriate stringency conditions which promote DNA hybridization, for example, 6.0 x 
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sodium chloride/sodium citrate (SSC) at about 45^C, followed by a wash of 2.0 x SSC at 
50°C, are known to those skilled in the art or can be found in Current Protocols in Molecular 
Biology, John Wiley & Sons, N.Y. (1989), 63.1-6.3.6. For example, the salt concentration in 
the wash step can be selected from a low stringency of about 2.0 x SSC at 50°C to a high 
stringency of about 0.2 x SSC at 50X. In addition, the temperature in the wash step can be 
increased from low stringency conditions at room temperature, about 22**C, to high 
stringency conditions at about 65°C. 

Nucleic acids, havmg a sequence that di£fers from the nucleotide sequences shown in 
one of SEQ ID No:l, SEQ ID No:2, SEQ ID No:3, SEQ ID No:4, SEQ ID No:5, SEQ ID 
No:6 or SEQ ID No:7 due to degeneracy in the genetic code are also within the scope of the 
mvention. Such nucleic acids racode functionally equivalent pq)tides (i.e., a peptide having 
a biological activity of a vertebrate hh polypeptide) but differ in sequence from the sequence 
shown in the sequence listing due to degeneracy m the genetic code. For example, a number 
of amino acids are designated by more than one triplet. Codons that specify the same amino 
acid, or synonyms (for example, CAU and CAC each encode histidine) may result in "silent" 
mutations which do not affect the amino acid sequence of a vertebrate hh polypeptide. 
However, it is expected that DNA sequence polymorphisms that do lead to changes in the 
amino acid sequences of the subject hh polypeptides will exist among vertebrates. One 
skilled in the art will appreciate that these variations in one or more nucleotides (up to about 
3-5% of the nucleotides) of the nucleic acids encoding polypeptides having an activity of a 
vertebrate hh polypeptide may exist among individuals of a given species due to natural 
allelic variation. 

Fragments of the nucleic acids encoding an active portion of the vertebrate hedgehog 
proteins are also witiun the scope of the invention. As used herein, a hedgehog gene 
fragment refers to a nucleic acid having fewer nucleotides than the nucleotide sequence 
encoding the entire amino acid sequence of a vertebrate hh protem represented in SEQ ID 
No:8, SEQ ID No:9, SEQ ID No:10, SEQ ID No:l 1, SEQ ID No:12, SEQ ID No:13 or SEQ 
ID No: 14, yet which (preferably) encodes a peptide which retains some biological activity of 
the full length protein, e.g. the fragment retains the ability to induce formation and 
differentiation of the head, limbs, lungs, central nervous system (CNS), or mesodermal 
patterning of developmg vertebrate embryo. Nucleic acid fragments within the scope of the 
present invention include those capable of hybridizing under high or low stringency 
conditions with nucleic acids from other species for use m screenii^ protocols to detect other 
hedgehog homologs, as well as those capable of hybridizing with nucleic acids from human 
specimens for use in detecting the presence of a nucleic acid encoding a hedgehog protem, 
including alternate isoforms, e.g. mRNA splicing variants. Nucleic acids within flie scope of 
the invention may also contain linker sequences, modified restriction endonuclease sites and 
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Other sequences useful for molecular cloning, expression or purification of recombinant 
forms of the subject hh polypeptides. 

As indicated by the examples set out below, hedgehog protein-encoding nucleic acids 
can be obtained torn mRNA present in any of a number of eukaryotic cells. It should also be 
5 possible to obtain nucldc acids encodmg vertebrate hh polypeptides of the present invention 
fix>m genomic DNA obtained fiom boft adults and embryos. For example, a gene encoding a 
hh protein can be cicmed fiom d&er a cDNA or a genomic library in accordance with 

protocob described herein, as weU as Aose genoally known to persons skilled in the art A 
cDNA encoding a hedgehog protem can be obtained by isolating total mRNA fiom a cell, 

10 e.g. a mammalian cell, e.g. a human cell, including embryonic cells. Double stranded 
cDNAs can then be prqjaied from tiie total mRNA, and subsequoitly inserted into a suitable 
plasmid or bacteriophage vector using any one of a number of known techniques. The gene 
encoding a vertebrate hh protein can also be cloned using established polymerase duun 
reaction techniques in accordance vdHh the nucleotide sequence information provided by the 

15 invaition. The nucleic acid of the mvention can be DNA or RNA. A preferred nucleic acid 
is a cDNA represented by a sequence selected from the group conasting of SEQ ID Nos: 1-7. 

Another aspect of the invoition relates to the use of the isolated nucleic acid in 
"antisense" fheapy. As used herran, "antisense" therapy refers to administration or in situ 
genoation of oligonucleotide probes or tlwu- derivatives which specifically hybridizes (e.g. 

20 binds) under cellular conditions, with die cellular mRNA and/or genomic DNA encoding one 
or more of the subject hedgehog protems so as to inhibit expression of that protein, e.g. by 
inhibiting transcription and/or translation. The landing may be by conventional base pan- 
complementarity, or, for example, in the case of binding to DNA duplexes, through specific 
interactions in the major groove of tiie double helix. In gmeral, "antisense" fheacpy refers to 

25 the range of techniques generally employed in the art, and includes any therapy vMch relies 
on ^)ecific Unding to oligonucleotide sequences. 

An antisense construct of the present invention can be delivered, for example, as an 
e3q>ression plasmid vAdch, vrhcn transcribed in the cell, produces RNA which is 
complementary to at least a unique portion of flie cellular mRNA which encodes a vertebrate 

30 hh protein. Alternatively, the antisense construct is an oligonucleotide probe which is 
generated ex vivo and vMch, when introduced into the cell causes inhibition of expression by 
hybridiang with the mRNA and/or genomic sequences of a vertebrate hh gme. Such 
oligonucleotide probes are preferably modified oligonucleotide v^iich are resistant to 
mdogenous nucleases, e.g. exoniuleases and/or mdonucleases, and is therefore stable in 

35 vivo. Exemplary nucleic acid molecules for use as antisense oligonucleotides are 
phosphoramidate, phosphothioate and methylphosphonate analogs of DNA (see also U.S. 
Patents 5,176,996; 5,264,564; and 5,256,775). AdditionaUy, general q>proaches to 
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constructing oligomers usefiil in antisense therapy have been reviewed, for example, by Van 
der Krol et al. (1988) Biotechniques 6:958-976; and Stein et al. (1988) Cancer Res 48:2659- 
2668. 

Accordingly, the modified oligomers of the invention are usefiil in therapeutic, 
5 diagnostic, and research contexts. In therapeutic applications, the oligomers are utilized in a 
manner appropriate for antisense therapy in general. For such therapy, the oligom^ of the 
invention can be formulated for a variety of loads of administration, including systemic and 
topical or localized administration. Techniques and formulations generally may be found in 
Remmington's Pharmaceutical Sciences. Meade Publishing Co., Easton, PA. For systemic 
10 administration, injection is preferred, including intramuscular, intravenous, intraperitoneal, 
and subcutaneous for injection, the oligomers of &e invention can be formulated in liquid 
solutions, preferably in physiologically compatible biiffers such as Hank's solution or 
Ringer's solution. In addition, the oligomers may be formulated in solid form and 
redissolved or suspended immediately prior to use. Lyophilized forms are also included. 

15 Systemic administration can also be by transmucosal or transdermal means, or the 

compounds can be administered orally. For transmucosal or transdermal administration, 
penetrants appropriate to the barrier to be permeated are used in the formulation. Such 
penetrants are generally knoivn in the art, and include, for example, for transmucosal 
administration bile salts and iusidic acid derivatives. In addition, detergents may be used to 

20 facilitate permeation. Transmucosal administration may be through nasal sprays or using 
suppositories. For oral administration, the oligomers are formulated into conventional oral 
administration forms sudi as cs^sules, tablets, and tonics. For topical administratioii, the 
oligomers of the invention are formulated into ointments, salves, gels, or creams as generally 
known in the art 

25 In addition to use in therapy, the oligomers of the invention may be used as diagnostic 

reagents to detect the presence or absence of the target DNA or RNA sequences to which 
they specifically bind. Such diagnostic tests are described in further detail below. 

Likewise, the antisense constructs of the preset invention, by antagonizing the 
normal biological activity of one of the hedgehog proteins, can be used in the manipulation 
30 of tissue, e.g. tissue differentiation, both in vivo and in ex vivo tissue cultures. 

Also, the anti-sense techniques (e.g. microinjectioh of antisense molecules, or 
transfection with plasmids whose transcripts are anti«-sense with regard to an Mi mRNA or 
gene sequence) can be used to investigate role of hh in developmental events, as well as the 
normal cellular function of hh in adult tissue. Such techniques can be utilized in cell culture, 
35 but can also be used in the creation of transgenic animals. 
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This invention also provides expression vectors containing a nucleic acid encoding a 
vertebrate hh polypeptide, operably linked to at least one transcriptional regulatory sequence. 
Operably linked is intended to mean that the nucleotide sequence is linked to a regulatory 
sequence in a manner which allows expression of the nucleotide sequence. Regulatory 
5 sequences are art-recognized and are selected to direct expression of the subject vertebrate hh 
protems. Accordingly, the term transcriptional regulatory sequence includes promoters, 
enhancers and other expression control elements. Such regulatory sequences are described in 
Goeddel; Gene Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, CA (1990). For instance, any of a wide variety of ejcpression control sequences, 

10 sequences that control the expression of a DNA sequence when operatively linked to it, may 
be used in these vectors to express DNA sequences encoding vertebrate hh polypeptides of 
this invention. Such useful expression control sequences, include, for example, a viral LTR, 
such as the LTR of the Moloney murine leukemia virus, the early and late promoters of 
SV40, adenovirus or cjrtomegalovirus immediate early promoter, the lac system, the tip 

15 system, the TAC or TRC system, T7 promoter whose expression is directed by T7 RNA 
polymerase, the major operator and promoter regions of phage X , the control regions for fd 
coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the 
promoters of acid phosphatase, e.g., Pho5, the promoters of the yeast a-mating factors, the 
polyhedron promoter of the baculovirus system and other sequences known to control the 

20 expression of genes of prokaryotic or eukaiyotic cells or tiieir viruses, and various 
combinations thereof It should be understood that the design of the expression vector may 
depend on such fectors as the choice of the host cell to be transformed and/or the type of 
protein desired to be expressed. Moreover, the vector's copy number, the ability to control 
that copy number and the expression of any other proteins encoded by the vector, such as 

25 antibiotic markers, should also be considered. In one embodiment, the expression vector 
includes a recombinant gene encoding a peptide having an agonistic activity of a subject 
hedgehog polypeptide, or alternatively, encoding a peptide which is an antagonistic form of 
the hh protein. Such expression vectors can be used to transfect cells and thereby produce 
polypeptides, including fusion proteins, encoded by nucleic acids as described herein. 

30 Moreover, the gene constructs of the present invention can also be used as a part of a 

gene therapy protocol to dehver nucleic acids encoding either an agonistic or antagonistic 
form of one of the subject vertebrate hedgehog proteins. Thus, another aspect of ttie 
invention features expression vectors for in vivo or in vitro transfection and expression of a 
vertebrate hh polypeptide in particular cell types so as to reconstitute the function of, or 

35 alternatively, abrogate the function of hedgehog-indnctd signaling m a tissue in which the 
naturally-occurring form of the protein is mise5q)ressed; or to deliver a form of the protein 
which alters differentiation of tissue, or which inhibits neoplastic transformation. 
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E}q3ression constructs of the subject vertebrate hh polypeptide, and mutants thereof, 
may be administered in any biologically effective earner, e.g. any formulation or 
composition capable of effectively delivering the recombinant gene to cells in vivo. 
Approaches include insertion of the subject gene in viral vectors including recombinant 

5 retroviruses, adenovirus, adeno-associated virus, and herpes simplex virus- 1, or recombinant 
bacterial or eukaiyotic plasmids. Viral vectors transfect cells directly; plasmid DNA can be 
delivered with the help of, for example, cationic liposomes (lipofectin) or derivatized (e.g. 
antibody conjugated), polylysine conjugates, gramacidin S, artificial viral envelopes or other 
such intracellular carriers, as well as direct injection of the gene construct or CaP04 

10 precipitation carried out in vivo. It will be appreciated that because transduction of 
appropriate target cells represents the critical first step in gene therapy, choice of the 
particular gene delivery system will depend on such factore as the phenotype of the intended 
target and the route of administration, e.g. locally or systemically. Furthermore, it will be 
recognized that the particular gene construct provided for in vivo transduction of hedgehog 

15 expression are also usefiil for in vitro transduction of cells, such as for use in the ex vivo 
tissue culture systems described below. 

A preferred approach for in vivo introduction of nucleic acid into a cell is by use of a 
viral vector containing nucleic acid, e.g. a cDNA, encoding the particular form of the 
hedgehog polypeptide desired. Infection of cells with a viral vector has the advantage that a 
20 large proportion of the targeted cells can receive the nucleic acid. Additionally, molecules 
encoded within the viral vector, e.g., by a cDNA contained in the viral vector, are expressed 
efficientiy in cells which have taken up viral vector nucleic acid. 

Retrovirus vectors and adeno-associated virus vectors are generally understood to be 
the recombinant gene delivery system of choice for tiie transfer of exogenous genes in vivo, 

25 particularly into humans. These vectors provide efficient deliv^y of genes into cells, and the 
transferred nucleic acids are stably integrated into the chromosomal DNA of the host A 
major prerequisite for the use of retroviruses is to ensure the safety of their use, particularly 
with regard to the possibility of the spread of wild-type virus in the cell population. The 
development of specialized cell lines (termed "packaging cells") which produce only 

30 replication-defective retroviruses has increased the utility of retroviruses for gene ther^y, 
and defective retroviruses are well characterized for use in gene transfer for gene therapy 
purposes (for a review see Miller, A.D. (1990) Blood 76:271). Thus, recombinant retrovirus 
can be constructed in which part of the retroviral coding sequence (gag, pol, env) has been 
replaced by nucleic acid encoding one of the subject proteins rendering the retrovirus 

35 replication defective. The replication defective retrovirus is then packaged into virions which 
can be used to infect a target cell through the use of a helper virus by standard techniques. 
Protocols for producing recombinant retroviruses and for infecting cells in vitro or in vivo 
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with such viruses can be found in Current Pmtonok in Mf>\^rn]nT ftifflrr Ausubel, F.M. et 
al. (eds.) Greene Publishing Associates, (1989), Sections 9.10-9.14 and other standard 
laboratory manuals. Examples of suitable retroviruses include pU, pZIP, pWE and pEM 
which are well known to those skilled in the art. Examples of suitable packaging virus lines 
for preparing both ecotropic and amphotropic retroviral systems include \|/Crip, .\|/Cre, v2 
and vj/Am. Retroviruses have been used to introduce a variety of genes into many different 
ceU types, including neuronal cells, in vitro and/or in vivo (see for example Eglitis, et al. 
(1985) Science 230:1395-1398; Danos and MuUigan (1988) Proc. Natl. Acad Scl USA 
85:6460-6464; Wilson et al. (1988) Proc. Natl. Acad. Sci. USA 85:3014-3018; Annentano et 
al. (1990) Proc. Natl. Acad Sci. USA 87:6141-6145; Huber et al. (1991) Proc. Natl. Acad- 
Sci. USA 88:8039-8043; Ferry et al. (1991) Proc. Natl. Acad Sci. USA 88:8377-8381^ 
Chowdhury et al. (1991) Science 254:1802-1805; van Beusechem et al. (1992) Proc. Natl. 
Acad Sci. USA 89:7640-7644; Kay et al. (1992) Human Gene Ther<^ 3:641-647; Dai et al. 
(1992) Proc. Natl. Acad Sci. USA 89:10892-10895; Hwu et al. (1993) J Immunol. 150:4104- 
4115; U.S. Patent No. 4,868,116; U.S. Patent No. 4,980,286; PCT Application WO 
89/07136; PCT Application WO 89/02468; PCT AppUcation WO 89/05345; and PCT 
Application WO 92/07573). 

Furthermore, it has been shown that it is possible to limit the infection spectrum of 
retroviruses and consequently of retroviral-based vectors, by modifying the viral packaging 
proteins on the surface of the viral particle (see, for exan^le PCT pubUcations W093/25234 
and WO94/06920). For instance, strategies for the modification of the infection spectnmi of 
retroviral vectors include: coupling antibodies specific for cell surfece antigens to the viral 
env protein (Roux et al. (1989) PNAS 86:9079-9083; Man et al. (1992) J. Gen Virol 
73:3251-3255; and Goud et al. (1983) Virology 163:251-254); or coupling ceU surface 
receptor ligands to the viral env proteins (Neda et al. (1991) J Biol Chem 266:14143-14146). 
Couplmg can be in the form of the chemical cross-linking with a protein or other variety (e.g. 
lactose to convert the env protein to an asialoglycoprotem), as well as by generating fiision 
proteins (e.g. single-chain antibody/env fiision proteins). This technique, while usefiil to 
limit or otherwise direct the infection to certain tissue types, can also be used to convert an 
ecotropic vector in to an amphotropic vector. 

Moreover, use of retroviral genedelivery can be finther enhanced by the use of tissue- 
or cell-specific transcriptional regulatory sequences which control expression of the hh gene 
of the retroviral vector. 

Another viral gene deUveiy system usefijl in the present invention utilizes adenovirus- 
derived vectors. The genome of an adenovirus can be manipulated such that it encodes and 
ejqwesses a gene product of interest but is inactivated in terms of its ability to repUcate in a 
normal lytic viral life cycle. See for example Berkner et al. (1988) BioTechniques 6:616; 
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Rosenfeld et al. (1991) Science 252:431-434; and Rosenfeld et al. (1992) Cell 68:143-155. 
Suitable adenoviral vectors derived from the adenovirus strain Ad type 5 dl324 or other 
strains of adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are Well known to those skilled in die art 
Recombinant adenoviruses can be advantageous in certain circumstances in that they can be 
5 used to infect a wide variety of cell types, includmg airway epithelium (Rosenfeld et al. 
(1992) cited supra), endothelial cells (Lemarchand et al. (1992) Proc. Natl Acad ScL USA 
89:6482-6486), hepatocytes (Herz and Gerard (1993) Proc. Natl Acad ScL USA 90:2812- 
2816) and muscle cells (Quantin et al. (1992) Proc. Natl. Acad Sci. USA 89:2581-2584). 
Furthermore, the virus particle is relatively stable and amenable to purification and 

10 concentration, and as above, can be modified so as to affect the spectrum of infectivity. 
Additionally, introduced adenoviral DNA (and foreign DNA contained therein) is not 
integrated into the genome of a host cell but remains episomal, thereby avoiding potential 
problems that can occur as a result of insertional mutagenesis in situations where introduced 
DNA becomes integrated into the host genome (e.g., retroviral DNA). Moreover, the 

15 carrying capacity of the adenoviral genome for foreign DNA is large (up to 8 kilobases) 
relative to oAct gene delivery vectors (Berkner et al. cited supra; Haj-Ahmand and Graham 
(1986) J. Virol 57:267). Most replication-defective adenoviral vectors currently in use and 
therefore favored by the present invention are deleted for all or parts of the viral El and E3 
genes but retain as much as 80% of the adenoviral genetic material (see, e.g., Jones et al. 

20 (1979) Cell 16:683; Berkner et al., siq>ra; and Graham et al. in Methods in Molecular 
Biology. EJ. Murray, Ed. (Humana, Cliflon, NJ, 1991) vol. 7. pp. 109-127). Expression of 
the inserted hedgehog gene can be under control of, for example, the ElA promoter, tht 
major late promoter (MLP) and associated leader sequences, the E3 promoter, or 
exogenously added promoter sequences. 

25 Yet another viral vector system useful for delivery of one of the subject vertebrate hh 

genes is the adeno-associated virus (AAV). Adeno-associated virus is a naturally occurring 
detective virus that requires another virus, such as an adenovirus or a herpes virus, as a helper 
virus for efiBcient replication and a productive life cycle. (For a review see Muzyczka et al. 
Curr. Topics in Micro, and Immunol (1992) 158:97-129). It is also one of the few viruses 

30 that may integrate its DNA into non-dividing cells, and exhibits a high frequency of stable 
integration (see for example Flotte et al. (1992) Am. J. Respir. Cell Mol Biol 7:349-356; 
Samulski et al. (1989) J. Virol 63:3822-3828; and McLaughUn et ^. (1989) 1 Virol 
62:1963-1973). Vectors containing as little as 300 base pairs of AAV can be packaged and 
canmtegrate. Space for exogenous DNA is lunited to about 4.5 kb. An AAV vector such as 

35 that described in Tratschm et al. (1985) Mol Cell Biol 5:3251-3260 can be used to mtroduce 
DNA into cells. A variety of nucleic acids have been introduced into different cell types 
using AAV vectors (see for example Hermonat et al. (1984) Proc. Natl Acad ScL USA 
81:6466-6470; Tratschin et al. (1985) Mol Cell Biol 4:2072-2081; Wondisford et al. (1988) 
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Mol Endocrinol 2:32-39; Tratschin et al. (1984) J. Virol 51:611-619; and Flotte et al. 
(1993) J. Biol Chem. 268:3781-3790). 

In addition to viral transfer methods, such as those illustrated above, non-viral 
methods can also be employed to cause expression of a subject hedgehog polypeptide in the 
5 tissue of an animal. Most nonviral methods of gene transfer rely on normal mechanisms used 
by mammalian cells for the uptake and intracellular transport of macromolecules. In 
preferred embodiments, non-viral gene delivery systems of the present invention rely on 
endocytic pathways for the uptake of the subject hh polypeptide gene by the targeted cell. 
Exemplary gene delivery systems of this type include liposomal derived systems, poly-lysine 
1 0 conjugates, and artificial viral envelopes. 

In clinical settings, the gene delivery systems for the therapeutic hedgehog gene can 
be introduced into a patient by any of a number of methods, each of which is familiar in the 
art. For instance, a pharmaceutical preparation of the gene delivery system can be introduced 
systemically, e.g. by intravenous injection, and specific transduction of the protein in the 

15 target cells occurs predominantly fi^om specificity of transfection provided by the gene 
delivery vehicle, cell-type or tissue-type expression due to the transcriptional regulatory 
sequences controlling expression of the receptor gene, or a combination thereof. In other 
embodiments, initial delivery of the recombinant gene is more limited with introduction into 
the animal being quite localized. For example, the gene delivery vehicle can be introduced 

20 by catheter (see U.S. Patent 5^28,470) or by stereotactic injection (e.g. Chen et al, (1994) 
PNAS 91 : 3054-3057). A vertebrate hh gene, such as any one of the clones represented in the 
group consistmg of SEQ ID NO: 1-7, can be delivered in a gene therapy construct by 
electroporation using techniques described, for example, by Dev et al. ((1994) Cancer Treat 
J?ev 20:105-1 15). 

25 The pharmaceutical preparation of the gene therapy construct can consist essentially 

of the gene delivery system in an acceptable diluent, or can comprise a slow release matrix in 
which the gene delivery vehicle is imbedded. Alternatively, where the complete gene 
delivery system can be produced intact fi-om recombinant cells, e.g. retroviral vectors, the 
pharmaceutical preparation can comprise one or more cells which produce the gene delivery 

30 system. 

Another aspect of the present invention concerns recombinant forms of the hedgehog 
proteins. Recombinant polypeptides preferred by the present invention, in addition to native 
hedgehog proteins, are at least 60% homologous, more preferably 70% homologous and most 
preferably 80% homologous with an amino acid sequence represented by any of SEQ ID 
35 Nos:8-14. Polypeptides which possess an activity of a hedgehog protein (i.e. either agonistic 
or antagonistic), and which are at least 90%, more preferably at least 95%, and most 
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preferably at least about 98-99% homologous witb a sequence selected from the group 
consisting of SEQ ID Nos:8-14 are also within the scope of the invention. 

The term "recombinant protein" refers to a polypeptide of the present invention which 
is produced by recombinant DNA techniques, wherein generally, DNA encoding a vertebrate 

5 hh polypeptide is inserted into a suitable expression vector which is in turn used to transform 
a host cell to produce the heterologous protein. Moreover, the phrase "derived from", with 
respect to a recombinant hedgehog gene, is meant to include within Ifae meaning of 
"recombinant protein" fliose proteins having an ammo acid sequence of a native hedgehog 
protein, or an amino acid sequence similar thereto which is generated by mutations including 

1 0 substitutions and deletions (including truncation) of a naturally occurring form of the protein. 

The present invention further pertains to recombinant forms of one of the subject 
hedgehog polypeptides which are encoded by genes derived from a vertebrate organism, 
particularly a mammal (e.g- a human), and which have amino acid sequences evolutionarily 
related to the hedgehog proteins represented in SEQ ID Nos:8-14. Such recombmant hh 

15 polypeptides preferably are capable of ftmctioning m one of either role of an agonist or 
antagonist of at least one biological activity of a wild-type ("auflientic") hedgehog protein of 
the appended sequence listing. The term "evolutionarily related to", with respect to amino 
acid sequences of vCTtebrale hedgehog proteins, refers to both polypeptides having amino 
acid sequences which have arisen naturally, and also to mutational variants of vertebrate hh 

20 polypeptides which are derived, for example, by combinatorial mutagenesis. Such 
evolutionarily derived hedgehog protems polypeptides preferred by the present invention are 
at least 60% homologous, more preferably 70% homologous and most preferably 80% 
homologous with the amino acid sequence selected from the group consisting of SEQ ID 
Nos:8-14. Polypeptides havmg at least about 90%, more preferably at least about 95%, and 

25 most preferably at least about 98-99% homology with a sequence selected from the group 
consisting of SEQ ID Nos:8-14 are also within the scope of the invention. 

The present invention further, pertains to methods of producing the subject hedgehog 
polypeptides. For example, a host cell transfected with a nucleic acid vector directing 
expression of a nucleotide sequence encoding the subject polypeptides can be cultured under 

30 appropriate conditions to allow oqrossion of the peptide to occur. The polypeptide 
hedgehog may be secreted and isolated from a mixture of cells and medium containing tiie 
recombmant vwtebrate hh polypeptide. Alternatively, tiie peptide may be retained 
cytoplasmically by removing the signal peptide sequence from the recombinant hh gene and 
the cells harvested, lysed and the protein isolated. A cell culture includes host cells, media 

35 and other byproducts. Suitable media for cell culture are well known in the art. The 
recombinant hh polypq3tide can be isolated from cell culture medium, host cells, or both 
using techniques known in the art for purifying proteins including ion-exchange 
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chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and 
immunoaffinity purification with antibodies specific for such peptide. In a preferred 
embodiment, the recombinant hh polypeptide is a fiision protein containing a domain which 
facilitates its purification, such as an hh/GST fiision protein. 

5 This invention also pertains to a host cell transfected to es^ress a recombinant form of 

the subject hedgehog polypeptides. The host cell may be any prokaiyotic or eukaiyotic cell. 
Thus, a nucleotide sequence derived 6om the cloning of vertebrate hedgehog proteins, 
encoding all or a selected portion of the full-length protein, can be used to produce a 
recombinant form of a vertebrate hh polypeptide via microbial or eukaryotic cellular 

10 processes. Ligatmg the polynucleotide sequence into a gene construct, such as an expression 
vector, and transforming or transfecting into hosts, either eukaryotic (yeast, avian, insect or 
mammalian) or prokaiyotic (bacterial cells), are standard procedures used in producing other 
well-known proteins, e.g. insulin, interferons, human growth hormone, IL-1, IL-2, and the 
like. Similar procedures, or modifications thereof, can be employed to prepare recombinant 

15 hedgehog polypeptides by microbial means or tissue-culture technology in accozd with the 
subject invention. 

The recombinant hedgehog genes can be produced by ligating nucleic acid encoding 
an Mt protein, or a portion th»eof, mto a vector suitable for expression in either prokaiyotic 
cells, eukaryotic cells, or bofli. Expression vectors for production of recombinant forms of 
20 the subject hh polypeptides include plasmids and other vectors. For instance, suitable vectois 
for the expression of a hedgehog polypeptide include plasmids of the types: pBR322-derived 
plasmids, pEMBL-derived plasmids, pEX-derived plasmids, pBTac-derived plasmids and 
pUC-derived plasmids for expression in prokaryotic cells, such as E, coli. 

A mmiber of vectors exist for the expression of recombinant proteins in yeast. For 
25 instance, YEP24, YIP5, YEP51, YEP52, pYES2, and YRP17 are cloning and expression 
vehicles useful in the introduction of genetic constructs into S. cerevisiae (see, for example, 
Broach et al (1983) in Experimental Manipulation of Gene Expression^ ed. M. Inouye 
Academic Press, p. 83, incorporated by reference herein). These vectors can replicate in 
coli due the presence of the pBR322 ori, and in 5. cerevisiae due to the replication 
30 detenninant of the yeast 2 micron plasmid. In addition, drug resistance markers such as 
ampicillin can be used. In an illustrative embodiment, an hh polypeptide is produced 
recombinantly utilizing an e}q)ression vector generated by sub-cloning the coding sequence of 
one of the hedgehog genes represented in SEQ ID Nos: 1 -7. 

The preferred mammalian expression vectors contain both prokaryotic sequences, to 
35 facilitate the propagation of the vector in bacteria, and one or more eukaryotic transcription 
units that are expressed in eukaryotic cells. The pcDNAI/amp, pcDNAI/neo, pRc/CMV, 
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pSV2gpt, pSV2neo, pSV2-dhfi:, pTk2, pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived 
vectors are examples of mammalian expression vectors suitable for transfection of exikaryotic 
cells. Some of these vectors are modified with sequences firom bacterial plasmids, such as 
pBR322, to facilitate replication and drug resistance selection in both prokaryotic and 
5 eukaryotic cells. Alternatively, derivatives of viruses such as the bovine papillomavirus 
(BPV-1), or Epstein-Barr virus (pHEBo, pREP-derived and p205) can be used for transient 
expression of proteins in eukaryotic cells. The various methods employed in the preparation 
of the plasmids and transformation of host organisms are well known in the art. For other 
suitable expression systems for both prokaryotic and eukaryotic cells, as well as general 
10 recombinant procedures, see Molecular Cloning A Laboratory Manual, 2nd Ed,, ed. by 
Sambrook, Fritsch and Maniads (Cold Spring Harbor Laboratory Press: 1989) Chapters 16 
and 17. 

In some instances, it may be desirable to express the recombinant hedgehog 
polypeptide by the tise of a baculovirus expression system. Examples of such baculovirus 
15 expression systems include pVL-derived vectors (such as pVL1392, pVL1393 and pVL941), 
pAcUW-derived vectors (sudi as pAcUWl), and pBlueBac-derived vectors (such as the B-gal 
containing pBlueBac III). 

When it is desirable to express only a portion of an M protein, such as a form lacking 
a portion of the N-teiminus, i.e. a truncation mutant \;^ch lacks the signal peptide, it may be 

20 necessary to add a start codon (ATG) to the oligonucleotide fi:agment containing the desired 
sequence to be expressed. It is well known in the art that a methionine at tiie N-terminal 
position can be enzymatically cleaved by the use of the eazyme methionine anunopeptidase 
(MAP). MAP has been cloned from E. coli (Ben-Bassat et al. (1987) J. Bacterial 169:751- 
757) and Salmonella typhimurium and its in vitro activity has been demonstrated on 

25 recombmant proteins (Miller et al. (1987) PNAS W:271 8-1722). Therefore, removal of an N- 
tenninal methionine, if desired, can be achieved either in vivo by expressing hedgehog- 
derived polypq>tides m a host which produces MAP (e.g., £. coli or CM89 or S. cerevisiae), 
or in vitro by use of purified MAP (e.g., procedure of Miller et al., supra). 

Alternatively, the coding sequences for the polypeptide can be incorporated as a part 
30 of a fiision gene including a nucleotide sequence encoding a different polypeptide. This type 
of expression system can be usefiil under conditions where it is desirable to produce an 
immunogenic fragment of a hedgehog protein. For example, &e VP6 capsid protein of 
rotavirus can be used as an immunologic carrier protein for portions of the hh polypeptide, 
either in the monomeric form or in the form of a viral particle. The nucleic acid sequences 
35 corresponding to fhe portion of a subject hedgehog protein to which antibodies are to be 
raised can be incorporated into a fiision gene construct which includes coding sequences for a 
late vaccinia virus structural protein to produce a set of recombinant viruses expressing 
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fusion proteins comprising hh epitopes as part of the virion. It has been demonstrated with 
the use of immunogenic fusion proteins utilizing the Hepatitis B surface antigen fusion 
proteins that recombinant Hepatitis B virions can be utilized in this role as well. Sitnilarly, 
chimeric constructs coding for fusion proteins contaming a portion of an hh protein and the 
5 poliovirus capsid protein can be created to enhance immunogenicity of the set of polypeptide 
antigens (see, for example, EP Publication No: 0259149; and Evans et al. (1989) l^atwe 
339:385; Huang et al. (1988) J. Virol 62:3855; and Schlienger et al. (1992) 1 Virol 66:2). 

The Multiple Antigen Peptide system for peptide-based immunization can also be 
utilized to generate an immunogen, wherein a desired portion of an hh polypeptide is obtained 
10 directly from organo-chemical synthesis of the peptide onto an oligomeric branching lysine 
core (see, for example, Posnett et al. (1988) JBC 263:1719 and Nardelli et al. (1992) 
7. Immunol 148:914). Antigenic determinants of hh proteins can also be expressed and 
presented by bacterial cells. 

In addition to utilizing fusion proteins to enhance immunogenicity, it is widely 
1 5 appreciated that fusion proteins can also facilitate the expression of proteins, and accordingly, 
can be used in the expression of the vertebrate hh polypeptides of the present invention. For 
exjomplty hedgehog polypeptides can be generated as glutathione-S-transferase (GST-fiision) 
proteins. Such GST**iusion proteins can enable easy purification of the hedgehog 
polypq>tide, as for exan^le by the use of glutatfaxone-derivadzed matrices (see, for example, 
20 Current Protocols in Molecular Biology, eds. Ausubel et al. (N.Y.: John Wiley & Sons, 
1991)). In another embodiment, a fusion gene coding for a purification leader sequence, such 
as a poly-(His)/enterokinase cleavage site sequence, can be used to replace the signal 
sequence which naturally occurs at the N-terminus of the hh protein (e.g., of the pro-form, in 
order to permit purification of the poly(His)-M protein by affinity chromatography using a 
25 Ni2+ metal resm. The purification leader sequence can thra be subsequently removed by 
treatment with enterokinase (e.g., see Hochuli et al, (1987) J. Chromatogrcphy 411:177; and 
Janknecht et al. PNAS 88:8972). 

Techniques for making fusion genes are known to those skilled in the art. Essentially, 
the joining of various DNA fiagments coding for different polypeptide sequences is 

30 performed in accordance with conventional techniques, employing blunt-ended or stagger- 
ended termini for ligation, restriction enzyme digestion to provide for appropriate termini, 
filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable 
joining, and enzymatic ligation. In another embodiment, the fusion gene can be synthesized 
by conventional techniques including automated DNA synthesizers. Alternatively, PGR 

35 amplification of gene firagments can. be carried out using anchor primers which give rise to 
complementary overhangs between two consecutive gene firagments which can subsequently 
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be annealed to generate a chimeric gene sequence (see, for example, Current Protocols in 
Molecular Biology^ eds. Ausubel et al. John Wiley & Sons: 1992). 

Hedgehog polypeptides may also be chemically modified to create hh derivatives by 
forming covalent or aggregate conjugates with other chemical moieties, such as glycosyl 
5 groups, lipids, phosphate, acetyl groups and the like. Covalent derivatives of hedgehog 
proteins can be prepared by linking the chemical moieties to functional groups on amino acid 
sidechains of the protein or at the N-terminus or at the C-terminus of the polypeptide. 

For instance, hedgehog proteins can be generated to include a moiety, other than 
sequence naturally associated with the protein, that binds a component of the extracellular 

10 matrix and enhances localization of the analog to cell surfaces. For example, sequences 
derived from the fibronectin "type-Hi repeat", such as a tetr^eptide sequence R-G-D-S 
(Pierschbacher et al. (1984) Natia^e 309:30-3; and Komblihtt et al. (1985) EMBO 4:1755-9) 
can be added to the hh polypeptide to support attachment of the chimeric molecule to a cell 
throi^ binding ECM components (Ruoslahti et al. (1987) Science 238:491-497; 

15 Kerschbacheret al. (1987) J. Biol Chem. 262:17294-8.; Hynes (1987) Cell 48:549-54; and 
Hynes (1992) Ce// 69:1 1-25). 

The present invention also makes available isolated hedgehog polypeptides which are 
isolated from, or otherwise substantially free of other cellular and extracellular proteins, 
especially morphogenic proteins or other extracellular or cell surface associated proteins 

20 which may normally be associated with the hedgehog polypeptide. Hie term "substantially 
free of other cellular or extracellular proteins" (also referred to herein as "contaminating 
proteins") or "substantially pure or purified preparations" are defined as encompassing 
preparations of hh polypeptides having less than 20% (by dry weight) contaminating protein, 
and preferably having less than 5% contaminating protein. Fimctional forms of the subject 

25 polypeptides can be prepared, for the first time, as purified preparations by using a cloned 
gene as described herein. By "purified", it is meant, when referring to a peptide or DNA or 
RNA sequence, that the indicated molecule is present in the substantial absence of other 
biological macromolecules, such as other proteins. The term "purified" as used herein 
preferably means at least 80% by dry weight, more preferably in the range of 95-99% by 

30 weight, and most preferably at least 99.8% by weight, of biological macromolecules of the 
same type present (but water, buffers, and other small molecules, especially molecules 
having a molecular weight of less than 5000, can be present). The term "pure" as used herein 
preferably has the same numerical limits as "purified" immediately above. "Isolated" and 
"purified" do not encompass either natural materials in their native state or natural materials 

35 that have been separated mto components (e.g., in an acrylamide gel) but not obtained either 
as pure (e.g. lacking contaminating proteins, or chromatography reagents such as denaturing 
agents and polymers, e.g. acrylamide or agarose) substances or solutions. In preferred 
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embodiments, purified hedgehog preparations will lack any contaminating proteins from the 
same animal from that hedgehog is normally produced, as can be accomplished by 
recombinant expression of, for example, a human hedgehog protein in a non-human cell. 

As described above for recombinant polypeptides, isolated hh polypeptides can 
5 include all or a portion of the amino acid sequences represented in SEQ ID No:8, SEQ ID 
No:9, SEQ ID No:10, SEQ ID No:l 1, SEQ ID No:12, SEQ ID No:13 or SEQ ID No:14, or a 
homologous sequence thereto. Preferred fragments of the subject hedgehog proteins 
correspond to the N^tenninal and C-terminal proteolytic fragments of the mature protein (see, 
for instance. Examples 6 and 9). 

10 Isolated peptidyl portions of hedgehog proteins can be obtained by screening peptides 

recombinantly produced fix)m the corresponding fragment of the nucleic acid encoding such 
peptides. In addition, fragments can be chemically synthesized using techniques known in 
. the art such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, a 
hedgehog polypeptide of the present invention may be arbitrarily divided into fragments of 

15 desired length with no overlap of the fragments, or preferably divided into overlapping 
fragments of a desired length. The fragments can be produced (recombinantly or by chemical 
synthesis) and tested to identify those peptidyl fragments which can function as either 
agonists or antagonists of a wild-type (e.g., "authentic") hedgehog protein. 

The recombinant hedgehog polypeptides of the present invention also include 
20 homologs of the authentic hedgehog proteins, such as visions of those protein which are 
resistant to proteolytic cleavage, as for example, due to mutations which alter potential 
cleavage sequences or which inactivate an en2ymatic activity associated with the protein. 
Hedgehog homologs of the present invention also include proteins which have been post- 
translationally modified in a manner difierent than the authentic protein. Exemplary 
25 derivatives of vertebrate hedgehog proteins include polypeptides which lack N-glycosylation 
sites (e.g. to produce an imglycosylated protein), or which lack N-terrainal and/or C-terminal 
sequences. 

Modification of the structure of the subject vertebrate hh polypeptides can be for such 
purposes as enhancing therapeutic or prophylactic eflBcacy, or stability (e.g., ex vivo shelf life 
30 and resistance to proteolytic degradation in vivo). Such modified peptides, when designed to 
retain at least one activity of the naturally-occurring form of the protein, are considered 
functional equivalents of the hedgehog polypeptides described in more detail herein. Such 
modified peptides can be produced, for instance, by amino acid substitution, deletion, or 
addition. 

35 For example, it is reasonable to expect that an isolated replacement of a leucine v^tfa 

an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar 
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replacement of an amino acid with a structurally related amino acid (i.e. isosteric and/or 
isoelectric mutations) will not have a major effect on the biological activity of the resulting 
molecule. Conservative replacements are those that take place within a family of amino acids 
that are related in their side chains. Genetically encoded amino acids are can be divided into 
5 four families: (1) acidic = aspartate, glutamate; (2) basic = lysine, arginine, histidine; (3) 

nonpolar = alanine, valine, .leucine, isoleucine, proline, phenylalanine, methionine, * 
tryptophan; and (4) uncharged polar = glycine, asparagine, glutamine, cysteine, serine, 
threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes classified jointly 
as aromatic amino acids. In similar fashion, the amino acid repertoire can be grouped as (1) 

10 acidic = aspartate, glutamate; (2) basic = lysine, arginine histidine, (3) aliphatic = glycine, 
alanine, valine, leucine^ isoleucine, serine, threonine, with serine and threonine optionally be 
grouped separately as aliphadc-hydroxyl; (4) aromatic = phenylalanine, tyrosine, tryptophan; 
(5) amide asparagine, glutamine; and (6) stdfiir -containing = cysteine and methionine, 
(see, for example. Biochemistry, 2nd ed., Ed. by L. Stryer, WH Freeman and Co.: 1981). 

15 Whether a change in the amino acid sequence of a peptide results in a functional hedgehog 
homolog (e.g. functional in the sense that it acts to mimic or antagonize the wild-type form) 
can be readily determined by assessing the ability of the variant peptide to produce a response 
in cells in a fiaishion similar to the wild-type protein, or competitively inhibit such a response. 
Polypeptides in which more than one replacement has taken place can readily be tested in the 

20 same manner. 

This invention further contemplates a method for generating sets of combinatorial 
mutants of the subject hedgehog proteins as well as truncation mutants, and is especially 
useful for identifying potential variant sequences (e.g. homoiogs) that are functional in 
binding to a receptor for hedgehog proteins. The purpose of screening such combinatorial 

25 libraries is to generate, for example, novel hh homoiogs v^ch can act as either agonists or 
antagonist, or alternatively, possess novel activities all together. To illustrate, hedgehog 
homoiogs can be engineered by tiie present method to provide more efScient binding to a 
cognate receptor, yet still retain at least a portion of an activity associated with hh. Thus, 
combinatorially-derived homoiogs can be generated to have an increased potency relative to a 

30 naturally occurring form of the protein. Likewise, hedgehog homoiogs can be generated by 
the present combinatorial ^proach to act as antagonists, in that they are able to mimic, for 
example, binding to other extracellular matrix components (such as receptors), yet not induce » 
any biological response, thereby inhibiting the action of au&entic hedgehog or hedgehog 
agonists. Moreover, manipulation, of certain domains of hh by the present method can 

35 provide domains more suitable for use in fusion proteins, such as one that incorporates 
portions of other proteins which are derived from the extracellular matrix and/or which bind 
extracellular matrix components. 
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In one aspect of this method, the amino acid sequences for a population of hedgehog 
homologs or other related proteins are aligned, preferably to promote the highest homology 
possible. Such a population of variants can include, for example, hh homologs from one or 
more species. Ammo acids which appear at each position of the aligned sequences are 
selected to create a degenerate set of combinatorial sequences. In a preferred embodiment, 
the variegated library of hedgehog variants is generated by combmatorial mutagenesis at the 
nucleic acid level, and is encoded by a variegated gene library. For instance, a mixture of 
synthetic oligonucleotides can be enzymatically ligated into gene sequences such that the 
degenerate set of potential hh sequences are expressible as individual polypeptides, or 
alternatively, as a set of larger fusion proteins (e.g. for phage display) contaming the set of hh 
sequences therein. 

As illustrated in Figure SA, to analyze the sequences of a population of variants, the 
amino acid sequences of interest can be aligned relative to sequence homology. The presence 
or absence of amino acids from an aligned sequence of a particular variant is relative to a 
chosen consensus length of a reference sequence, which can be real or artificial. In order to 
maintain the highest homology in alignment of sequences, deletions in the sequence of a 
variant relative to the reference sequence can be represented by an amino acid space (• or *), 
^le insertional mutations in the variant relative to the refer^ice sequence can be 
disregarded and left out of the sequence of the variant when aligned. For instance, Figure 5 A 
includes the alignment of several cloned fonns of hh from different species. Analysis of the 
alignment of the hh clones shown in Figure 5 A can give rise to the generation of a degenerate 
library of polypeptides comprising potential hh sequences. 

In an illustrative embodiment, alignment of exon 1/2 encoded sequences (e.g. the N- 
terminal approximately 165 residues of the mature protein) of each of the Shh clones 
produces a degenerate set of Shh polypeptides represented by the general formula: 

C-G-P-G-R-G-X(l) -G -X(2) -R-R-H-P-K-K-L-T-P-L-A-Y-K-rQ-F-I-P-N-V- 
A-E-K-T-L-G-A-S-G-R-Y-E-G-K-I-X{3)-R-N-S-B-R"F-K-E-L-T-P-N-y-N- 
P-D-I-I-F-K-D-B-E-N-T-G-A-D-R-L-M-T-Q-R-C-K-D-K-L-N-X(4) -L-A«-I- 
S-V-M-N-X(5) -W-P-G-V.X{6) -L-R-V-T-E-G-W-D-E-D-G-H-H-X (7) -E-E-S- 
L-H-Y-E-G-R-A-V-D-I-T-T-S-D-R-D-X (8) -S-K-Y-G -X(9) -L-X (10) -R-L- 
A-V-E-A-G-F-D-W-V-Y-Y-E-S-K-A-H-I-H-C-S-V-K-A-E (SEQ ID No: 
40), 

wherein each of the degenerate positions "X" can be an amino acid which occurs in that 
position in one of the human, mouse, chicken or zebrafish Shh clones, or, to expand the 
Ubrary, each X can also be selected from amongst amino acid residue which would be 
conservative substitutions for the amino acids which appear naturally in each of those 
positions. For instance, Xaa(l) represents Gly, Ala, Val, Leu, He, Phe, Tyr or Trp ; Xaa(2) 
represents Arg, His or Lys; Xaa(3) represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(4) 
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represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(5) represents Lys, Arg, His, Asn or Gin; 
Xaa(6) represents Lys, Arg or His; Xaa(7) represents Ser, Thr, Tyr, Trp or Phe; Xaa(8) 
represents Lys, Arg or His; Xaa(9) represents Met, Cys, Ser or Thr; and Xaa(10) represents 
Gly, Ala, Val, Leu, He, Ser or Thr. In an even more expansive library, each X can be selected 
from any amino acid 

In similar feshion, alignment of each of the human, mouse, chicken and zebrafish 
hedgehog clones (Figure SB), can provide a degoierate polypeptide sequence represented by 
the general formula: 

C-O-P-G-R-S-X (1) -X (2) -X (3) -R-R-X (4) -X {S) -X (6) -P-K-X (7) -L-X (8) - 
P-L-X(9)-y-K-Q-F-X(10)-P-X{ll)-X(12)-X{13)-E-X(14) -T-L-G-A-S-G- 
X (15) -X (16) -B-G-X (17) -X (18) -X (19) -R-X (20) -S-B-R-F-X (21) -X (22) - 
L-T-P-N-y-N-P-D-I-I-F-K-D-E-E-N -X(23) -G-A-D-R-L-M-T-X (24) -R-C- 
K-X (25) -X (26) -X (27) -N-X (28) -L-A-I-S-V-M-N-X (29) -W-P-G-V-X (30) - 
L-R-V-T-E-G-X(31) -D-E-D-G-H-H-X (32) -X(33)-X(34) -S-L-H-Y-E-G-R- 
A-X (35) -D- I-T-T-S-D-R-D-X (36) -X (37) -K-Y-G-X (38) -L-X (39) -R-L-A- 
V-E-A-G-F-D-W-V-Y-Y-E-S-X(40) -X(41) -H-X(42) -H-X(43) -S-V-K-X(44) 
-X(45) (SEQ ID No: 41) , 

wherein, as above, each of the degenerate positions "X" can be an amino acid which occurs in 
a corresponding position in one of the wild-type clones, and may also include amino acid 
residue >)^*iich would be conservative substitutions, or each X can be any amino acid residue. 
In an exemplary embodiment, Xaa(l) represents Gly, Ala, Val, Leu, He, Pro, Phe or Ty^ 
Xaa(2) represents Gly, Ala, Val, Leu or He; Xaa(3) represents Gly, Ala, Val, Leu, He, Lys, 
His or Arg; Xaa(4) represents Lys, Arg or His; Xaa(5) represents Phe, Trp, Tyr or an amino 
acid gap; Xaa(6) represents Gly, Ala, Val, Leu, He or an amino acid gap; Xaa(7) represents 
Asn, Gin, His, Arg or Lys; Xaa(8) represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(9) 
represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(10) represents Gly, Ala, Val, Leu, He, Ser 
or Thr; Xaa(l 1) represents Ser, Thr, Gbi or Asn; Xaa(12) represents Met, Cys, Gly, Ala, Val, 
Leu, He, Ser or Thr; Xaa(13) represents Gly, Ala, Val, Leu, He or Pro; Xaa(14) represents 
Arg, His or Lys; Xaa(15) represents Gly, Ala, Val, Leu, He, Pro, Arg, His or Lys; Xaa(16) 
represents Gly, Ala, Val, Leu, He, Phe or Tyr, Xaa(17) represents Arg, His or Lys; Xaa(18) 
represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(19) represents Thr or Ser; Xaa(20) 
represents Gly, Ala, Val, Leu, He, Asn or Ghu Xaa(21) represents Arg, His or Lys; Xaa(22) 
represents Asp or Glu; Xaa(23) represents Ser or Thr, Xaa(24) lepresoits Glu, Asp, Gki or 
Asn; Xaa(25) represents Glu or Asp; Xaa(26) r^resents Arg, His or Lys; XaaG27) represents 
Gly, Ala, Val, Leu or He; Xaa(28) represents Gly, Ala, Val, Leu, He, Thr or Ser; Xaa(29) 
represents Met, Cys, Ghi, Asn, Arg, Lys or His; Xaa(30) represents Arg, His or Lys; Xaa(31) 
represents Trp, Phe, Tyr, Arg, His or Lys; Xaa(32) represents Gly, Ala, Val, Leu, He, Ser, 
Thr, Tyr or Phe; Xaa(33) represents Gki, Asn, Asp or Glu; Xaa(34) represents Asp or Glu; 
Xaa(35) represents Gly, Ala, Val, Leu, or He; Xaa(36) rq)resents Arg, His or Lys; Xaa(37) 
represents Asn, Ghi, Thr or Ser, Xaa(38) represents Gly, Ala, Val, Leu, He, Ser, Thr, Met or 
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Cys; Xaa(39) represents Gly, Ala, Val, Leu, He, Thr or Ser; Xaa(40) represents Arg, His or 
Lys; Xaa(41) represents Asn, Gin, Gly, Ala, Val, Leu or lie; Xaa(42) represents Gly, Ala, 
Val, Leu or He; Xaa(43) represents Gly, Ala, Val, Leu, lie, Ser, Thr or Cys; Xaa(44) 
represents Gly, Ala, Val, Leu, He, Thr or Ser; and Xaa(45) represents Asp or Glu. 

5 There are many ways by which the library of potential hh homologs can be generated 

from a degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene 
sequence can be carried out in an automatic DNA synthesizer, and the synthetic genes then 
ligated into an appropriate expression vector. The purpose of a degenerate set of genes is to 
provide, in one mixture, ail of the sequences encoding the desired set of potential hh 

10 sequences. The synthesis of degenerate oligonucleotides is well known in the art (see for 
example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, 
Proc 3rd Cleveland Sympos. Macromolecules, ed, AG Walton, Amsterdam: Elsevier pp273- 
289; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 
198:1056; Ike et al. (1983) Nucleic Acid Res. 1 1 :477. Such techniques have been employed 

15 in the directed evolution of other ]nx>teins (see, for example, Scott et al. (1990) Science 
249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. (1990) Science 249: 
404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. Patents Nos. 5,223,409, 
5,198,346, and 5,096,815). 

A wide range of techniques are known in the art for screening gene products of 
20 combinatorial libraries made by point mutations, and for screening cDNA libraries for gene 
products having a certain property. Such techniques will be generally adaptable for rapid 
screening of the gene libraries generated by the combinatorial mutagenesis of hedgehog 
homologs. The most widely used techniques for screening large gene libraries typically 
comprises cloning the gene library into replicable expression vectors, transforming 
25 appropriate cells with the resulting library of vectors, and expressing the combinatorial genes 
under conditions in which detection of a desired activity facilitates relatively easy isolation of 
the vector encoding the gene whose product was detected. Each of the illustrative assays 
described below are amenable to high flurougfa-put analysis as necessary to screen large 
numbers of degmerate hedgehog sequences created by combinatorial mutagenesis 
30 techniques. 

In one embodiment, the combmatorial Ubrary is designed to be secreted (e.g. the 
polypeptides of the library all include a signal sequence but no transmembrane or 
cytoplasmic domains), and is used to transfect a eukaryotic cell that can be co-cultured with 
embryonic cells. A functional hedgehog protem secreted by the cells expressing the 
35 combinatorial library will diffuse to neighboring embryonic cells and induce a particular 
biological response, such as to illustrate, neuronal differentiation. Using antibodies directed 
to epitopes of particular neuronal cells (e.g. Islet^l or Pax-1), the pattern of detection of 
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neuronal induction will resemble a gradient function, and will allow the isolation (generally 
after several repetitive rounds of selection) of cells producing active hedgehog homologs. 
Likewise, hh antagonists can be selected in similar fashion by the ability of the cell producing 
a functional antagonist to protect neighboring cells from the effect of wild-type hedgehog 
5 added to the culture media. 

To illustrate, target cells are cultured in 24-well microtitre plates. Other eukaryotic 
cells are transfected with the combinatorial hh gene library and cultured in cell culture inserts 
(e.g. Collaborative Biomedical Products, Catalog #40446) that are able to fit mto Ae wells of 
the microtitre plate. The cell culture inserts are placed m the wells such that recombinant hh 

10 homologs secreted by the cells in the insert can difiuse through the porous bottom of the 
insert and contact the target cells in the microtitre plate wells. Aft^ a period of time 
sufficient for functional forms of a hedgehog protein to produce a measurable response in the 
target cells, the inserts are removed and the effect of the variant hedgehog proteins on the 
target cells determined. For example, where the target cell is a neural crest cell and the 

IS activity desired from the hh homolog is the induction of neuronal differentiation, then 
fluorescentiy-labeled antibodies specific for islet-1 or other neuronal markers can be used to 
score for mduction in the target cells as indicative of a functional hh in that well. Cells from 
the inserts corresponding to wells which score positive for activity can be split and re- 
cultured on several inserts, the process being repeated tmtil the active clones are identified. 

20 In yet another screening assay, the candidate hedgehog gene products are displayed on 

the sur&ce of a cell or vfral particle, and the ability of particular cells or viral particles to 
associate with a /le^^eAog-bindmg moiety (such as an hedgehog receptor or a ligand which 
binds the hedgehog protein) via this gene product is detected in a "pannii^ assay". Such 
panning steps can be canied out on cells cultured from embryos. For instance, the gene 

25 library can be cloned into the gene for a surface membrane protein of a bacterial cell, and the 
resultijig fusion protein detected by pannmg (Ladner et al„ WO 88/06630; Fuchs et al. (1991) 
Bio/Technology 9:1370-1371; and Goward et al. (1992) TIBS 18:136-140). In a sunilar 
fashion, fluorescentiy labeled molecules which bind hh can be used to score for potentially 
functional hh homologs. Cells can be visually inspected and separated under a fluorescence 

30 microscope, or, where the morphology of the cell permits, separated by a fluorescence- 
activated cell sorter. 

In an alternate embodiment, the gene library is expressed as a fusion protein on the 
surface of a viral particle. For instance, in the filamentous phage system, foreign peptide 
sequences can be expressed on the surface of infectious phs^e, thereby conferring two 
35 significant benefits. First, since these phage can be applied to a£5nity matrices at very high 
concentrations, large number of phage can be screened at one time. Second, since each 
infectious phage displays the combinatorial gene product on its sur£Eu:e, if a particular phage 
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is recovered from an afiSnity matrix in low yield, the phage can be amplified by another 
round of infection. The group of almost identical Rcoli filamentous phages Ml 3, fd, and fl 
are most often used in phage display libraries, as either of the phage gin or gVIII coat 
proteins can be used to generate fusion proteins without disnq)ting the ultimate packaging of 
5 the viral particle (Ladner et al. PCT publication WO 90/02909; Garrard et al., PCT 
publication WO 92/09690; Marks et al. (1992) J. Biol. Chem. 267:16007-16010; Griffths et 
al. (1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et al. 
(1992) PNAS 89:4457-4461). 

In an illustrative embodimoit, fhe recombinant phage antibody system (RPAS, 

10 Phaiamacia Catalog number 27-9400-01) can be easily modified for use in expressing and 
screening hh combinatorial libraries. For instance, flie pCANTAB 5 phagemid of the RPAS 
kit contains the gene which encodes the phage gin coat protein. The hh combinatorial gene 
library can be cloned into the phagemid adjacoit to the gin signal sequence such that it will 
be expressed as a gin fiision protein. After ligation, the phagemid is used to transform 

15 competent E. coli TGI cells. Transformed cells are subsequaitly infected with M13K07 
helper phage to rescue the phagemid and its candidate hh gene insert The resulting 
recombinant phage contain phagemid DNA encoding a specific candidate hh, and display one 
or more copies of the corresponding fusion coat protein. The phage-di^layed candidate 
hed%tho% proteins vdiich are capable of bmding an hh receptor are selected or enriched by 

20 panning. For instance, the phage library can be applied to cultured embryonic cells and 
unbound phage washed away from the cells. The bound phage is then isolated, and if the 
recombinant phage express at least one copy of the vwld type gHI coat protein, Aey will retam 
their ability to infect E. coli. Thus, successive rounds of reinfection of £. coli, and panning 
will greatly enrich for hh homologs, which can then be screened for further biological 

25 activities in order to differentiate agonists and antagonists. Moreover, differential panning, 
e.g., with two or more different W?-responsive cells, can facilitate isolation of hedgehog 
homologs of selectively narrower biological activity relative to the wild-type protein. 

The invention also provides for reduction of the vertebrate hh protein to generate 
mimetics, e.g. peptide or non-peptide agents, which are able to disrupt binding of a vertebrate 

30 hh polypeptide of the present invention with an hh receptor. Thus, sudi mutagenic 
techniques as described above are also useful to map the determinants of the hedgehog 
proteins which participate in protein-protein interactions involved in, for example, binding of 
the subject vertebrate hh polypeptide to other extracellular matrix components. To illustrate, 
the critical residues of a subject hh polypeptide or hh ligand which are involved in molecular 

35 recognition of an hh receptor can be determined and used to generate hedgehog-derived 
peptidomimetics which competitively inhibit binding of the authentic hedgehog protein with 
that moiety. By employing, for example, scanning mutagenesis to map the amino acid 
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residues of each of the subject hedgehog proteins which are involved in binding other 
extracellular proteins, peptidomimetic compounds can be generated which mimic those 
residues of the hedgehog protein vAach facilitate the interaction. Such mimetics may then be 
used to interfere with the nonnal fimction of a hedgehog protein. For instance, non- 

5 hydrolyzable peptide analogs of such residues can be generated using benzodiazepine (e.g., 
see Freidinger et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM 
Publisher: Leiden, Netherlands, 1988), azepine (e.g., see Huf6nan et al. in Peptides: 
Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), 
substituted gama lactam rings (Garvey et al. in Peptides: Chemistry and Biology, G.R. 

10 Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto-methylene pseudopeptides 
(Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et al. m Peptides: Structure and 
Function (Proceedings of the 9th American Peptide Symposium) Pierce Chemical Co. 
Rockland, XL, 1985), |J-tum dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and 
Sato et al. (1986) J Chem Soc PerMn Trans 1:1231), and p-aminoalcohols (Gordon et al. 

1 5 (1985) Biochem Biophys Res Communl26Al9; and Dann et al. (1986) Biochem Biophys Res 
Conanun 134:71). 

Another aqject of the invention pertains to an antibody specifically reactive with a 
vertebrate hedgehog protein. For example, by using immunogens derived from hedgehog 
protein, e.g. based on the cDNA sequences, anti-protein/anti-peptide antisera or monoclonal 
20 antibodies can be made by standard protocols (See, for example. Antibodies: A Laboratory 
Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)). A mammal, such as a 
mouse, a hamster or rabbit can be inununized with an immunogenic form of the peptide (e.g., 
a vertebrate Wi polypeptide or an antigenic fragment which is enable of eliciting an antibody 
response). Techniques for conferring inununogenicity on a protein or peptide include 
25 conjugation to carriers or other techniques well known m the art. An immunogenic portion 
of a hedgehog protein can be administered in the presence of adjuvant. The progress of 
immunization can be monitored by detection of antibody titers in plasma or serum. Standard 
ELISA or other immunoassays can be used wth the immuhogai as antigen to assess flie 
levels of antibodies. In a prefored onbodiment, the subject antibodies are immuno^ecific 
30 for antigenic determinants of a hedgehog protein of a vertebrate onanism, such as a 
mammal, e.g. antigenic determinants of a protein represented by SEQ ID Nos:8-14 or a 
closely related homolog (e.g. at least 85% homologous, prefwably at least 90% homologous, 
and more preferably at least 95% homologous). In yet a further preferred embodiment of the 
present invention, in order to provide, for example, antibodies vMch are immuno-selective 
35 for discrete hedgehog homologs, e.g. Shh versus Dhh versus Ihh, the aa&-hh polypeptide 
antibodies do not substantially cross react (i.e. does not react specifically) with a protem 
vMdti is, for example, less tiian 85% homologous to any of SEQ ID Nos:8-14; e.g., less than 
95% homologous with one of SEQ ID Nos:8-14; e.g., less than 98-99% homologous with 
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one of SEQ ID Nos:8-14. By "not substantially cross react", it is meant that the antibody has 
a binding affinity for a non-homologous protein which is at least one order of magnitude, 
more preferably at least 2 orders of magnitude, and even more preferably at least 3 orders of 
magnitude less than the binding affinity of the antibody for one or more of the proteins of 
SEQIDNos:8-14. 

FoIIovdng immunization of an animal with an antigenic preparation of a hedgehog 
protein, anti-M antisera can be obtained and, if desired, polyclonal anti-M antibodies isolated 
from the senmi. To produce monoclonal antibodies, antibody-producing cells (lymphocytes) 
can be harvested from an immunized animal and fused by standard somatic cell fusion 
procedures with immortalizing cells such as myeloma cells to yield hybridoma cells. Such 
techniques are well known in the art, an include, for example, the hybiidoma technique 
(originally developed by Kohler and Milstein, (1975) Nature, 256: 495-497), the human B 
cell hybridoma technique (JCozbar et al., (1983) Immunology Today, 4: 72), and the EBV- 
hybridoma technique to produce human monoclonal antibodies (Cole et al., (1985) 
Monoclonal Antibodies and Cancer Thertgjy, Alan R. Liss, Inc. pp. 77-96). Hybridoma cells 
can be screened immunochemically for jwoducdon of antibodies specifically reactive with a 
vertebrate hh polypeptide of the present invention and monoclonal antibodies isolated from a 
culture comprismg such hybridoma cells. 

The term antibody as used herein is intended to include fragments thereof which are 
also specifically reactive with one of the subject vertebrate hh polypeptides. Antibodies can 
be fiagmented using conventional techniques and the fi:agments screened for utility in tiie 
same manner as described above for whole antibodies. For example, F(ab)2 fragments can 
be generated by treating antibody with pepsin. The resuWng F(ab)2 fiBgment can be treated 
to reduce disulfide bridges to produce Fab fragments. The antibody of tiie present invention 
is further intoided to include bispecific and chimeric molecules having affinity for a 
hedgehog protein conferred by at least one CDR region of the antibody. 

Both monoclonal and polycloi»l antibodies (Ab) directed against authentic hedgehog 
polypeptides, or hedgehog variants, and antibody fragments such as Fab and F(ab)2, can be 
used to block the action of one or more hedgehog proteins and allow the study of the role of 
these proteins in, for example, embiyogenesis and/or maintenance of differential tissue. For 
example, purified monoclonal Abs can be injected directiy into the limb buds of cluck or 
mouse embryos. It is demonstrated in the examples below that hh is expressed in the limb 
buds of; for example, day 10.5 embiyos. Thus, the use of aoA-hh Abs during this 
developmental stage can allow assessmrait of the effect of M on the formation of limbs wi 
vivo. In a similar aroroach, hybridomas producing anti-M monoclonal Abs, or 
biodegradable gels m which anti-M Abs are suspaided, can be implanted at a site proxunal 
or withm the area at vMch hh action is intended to be blocked. Ejjperiments of this nature 
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can aid in deciphering the role of this and other factors that may be involved in limb 
patterning and tissue formation. 

Antibodies which specifically bind hedgehog epitopes can also be used in 
immunohistochemical staining of tissue samples in order to evaluate the abundance and 

5 pattern of expression of each of the subject /A polypeptides. An^-hedgehog antibodies can 
be used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate 
hedgehog protem levels in tissue as part of a clinical testing procedure. For instance, such 
measurements can be useful in predictive valuations of the onset or progression of 
neurological disorders, such as those marked by denervation-like or disuse-like symptoms. 

10 Likewise, the ability to monitor hh levels in an individual can allow detennination of the 
efficacy of a given treatment r^imen for an individual afflicted with such a (Usorder. The 
level of hh polypeptides may be measured in bodily fluid, such as in sanq>les of cerebral 
spinal fluid or amniotic fluid, or can be measured in tissue, such as produced by biopsy. 
Diagnostic assays using amd-hh antibodies can include, for example, immunoassays designed 

15 to aid in early diagnosis of a neurodegenerative disordw, particularly ones which are manifest 
at birth. Diagnostic assays using anti-AA polypeptide , antibodies can also include 
unmunoassays designed to aid in early diagnosis and phenotyping of a differentiative 
disorder, as well as neoplastic or hyperplastic disorders. 

Another application of m^-hh antibodies of the present invention is in the 
20 immunological screening of cDNA libraries constructed in expression vectors such as Xgtl 1, 
;Lgtl8-23, 3LZAP, and X0RF8. Messenger libraries of this type, having codmg sequences 
inserted in the correct reading frame and orientation, can produce fusion proteins. For 
instance, Xgtl 1 will produce fusion proteins whose amino termini consist of fi-galactosidase 
amino acid sequences and whose carboxy termini consist of a foreign polypeptide. Antigenic 
25 epitopes of an hh protein, e.g. other orthologs of a particular hedgehog protein or other 
homologs from the same species, can then be detected with antibodies, as, for example, 
reacting nitrocellulose filters lifted fi-om infected plates with anti-M antibodies. Positive 
phage detected by this assay can then be isolated from the infected plate. Thus, the presence 
of hedgehog homologs can be detected and cloned firom other animals, as can alternate 
30 isoforms (including splicing variants) fix>m humans. 

Moreover, the nucleotide sequences determined from the cloning of hh genes from 
vertebrate organisms will further allow for the generation of probes and primers designed for 
use in identifying and/or cloning hedgehog homologs in other cell types, e.g. from other 
tissues, as well as hh homologs from other vertebrate organisms. For instance, tiie present 
35 invention also provides a probe/primer comprising a substantially purified oligonucleotide, 
which oligonucleotide comprises a region of nucleotide sequence that hybridizes under 
stringent conditions to at least 10 consecutive nucleotides of sense or anti-sense sequence 
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selected from the group consisting of SEQ ID No:l, SEQ ID No:2, SEQ ID No:3, SEQ ID 
No;4, SEQ ID No:5, SEQ ID No:6 and SEQ ID No:7, or naturally occuxxing mutants thereof. 
For instance, primers based on the nucleic acid represented in SEQ ID Nos:l-7 can be used in 
PCR reactions to clone hedgehog homologs. Likewise, probes based on the subject hedgehog 
5 sequences can be used to detect transcripts or genomic sequences encoding the same or 
homologous proteins. In preferred embodiments, the probe further comprises a label group 
attached thereto and able to be detected, e.g. the label group is selected from the group 
consisting of radioisotopes, fluorescent compounds, enzymes, and enzyme co-factors. 

Such probes can also be used as a part of a diagnostic test kit for identifying cells or 
10 tissue which misexpress a hedgehog protem, such as by measuring a level of a hedgehog 
encoding nucleic acid in a sample of cells from a patient; e.g. detecting hh mRNA levels or 
determining whether a genomic hh gene has been mutated or deleted. 

To illustrate, nucleotide probes can be generated from the subject hedgehog genes 
which facilitate histological screenii^ of intact tissue and tissue samples for the presence (or 

IS absence) of hedgehog-encoding transcripts. Similar to the diagnostic uses of saaA-hedgehog 
antibodies, the use of probes du^cted to hh messages, or to genomic hh sequences, can be 
used for both predictive and therapeutic evaltiation of allelic mutations which might be 
manifest in, for example, neoplastic or hyperplastic disorders (e.g. unwanted cell growth) or 
abnormal differentiation of tissue. Used in conjunction with immunoassays as described 

20 above, the oligonucleotide probes can help facilitate the determination of the molecular basis 
for a developmental disorder which may involve some abnormality associated with 
expression (or lack thereof) of a hedgehog protein. For instance, variation in polypeptide 
synthesis can be differentiated from a mutation in a coding sequence. 

Accordingly, the present method provides a method for determining if a subject is at 
25 risk for a disorder characterized by aberrant control of differentiation or unwanted cell 
proliferation. For instance, the subject assay can be used in the screening and diagnosis of 
genetic and acquired disorders which involve alteration in one or more of die hedgehog 
genes. In preferred embodiments, the subject method can be generally characterized as 
comprising: detecting, in a tissue sample of the subject (e.g. a human patient), the presence 
30 or absence of a genetic lesion characterized by at least one of (i) a mutation of a gene 
encoding a hedgehog protein or (ii) the mis-expression of a hedgehog gene. To illustrate, 
such genetic lesions can be detected by ascertaining the existence of at least one of (i) a 
deletion of one or more nucleotides from a hedgehog gene, (ii) an addition of one or more 
nucleotides to a hedgehog gene, (iii) a substitution of one or more nucleotides of a hedgehog 
35 gene, (iv) a gross chromosomal reairangement of a hedgehog gene, (v) a gross alteration in 
the level of a messenger RNA transcript of an hh gene, (vi) the presence of a non-wild type 
splidng pattem of a messenger RNA transcript of a vertebrate hh gene, and (vii) a non-wild 
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type level of a hedgehog protein. In one aspect of the invention there is provided a 
probe/primer comprising an oligonucleotide containing a region of nucleotide sequence 
which is capable of hybridizing to a sense or antisense sequence selected from the group 
consisting of SEQ ID Nos:l-7, or naturally occurring mutants thereof, or 5' or 3* flanking 

5 sequences or intronic sequences naturally associated with a vertebrate hh gene. The probe is 
exposed to nucleic acid of a tissue sample; and the hybridization of the probe to the sample 
nucleic acid is detected. In certain embodiments, detection of the lesion comprises utilizing 
the probe/primer in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent No: 4,683,195 
and 4,683,202) or, alternatively, in a ligation chain reaction (LCR) (see, e.g., Landegran et 

10 al. (1988) Science, 241:1077-1080; and NaKazawa et al. (1944) PNAS 91:360-364) the later 
of which can be particularly useful for detecting point mutations in hedgehog genes. 
Alternatively, immunoassays can be employed to determine the level of hh proteins, either 
soluble or membrane bound. 

Yet another diagnostic screen employs a source of hedgehog protein directly. As 
1 5 described herein, hedgehog proteins of the present invention are involved in the induction of 
differentiation. Acconiingly, the pathology of certain difiFerentiative and/or proliferative 
disorders can be marked by loss of hedgehog sensitivity by the afflicted tissue. 
ConsequOTtly, the responise of a tissue or cell sample to an inductive amount of a hedgehog 
protein can be Used to detect and characterize certain cellular transformations and 
20 degenerative conditions. For instance, tissue/cell samples from a patient can be treated with 
a hedgehog agonist and the response of the tissue to the treatment determined. Response 
can be qtialified and/or quantified, for example, on the basis of phenotypic change as result 
of hedgehog induction. For example, expression of gene products induced by hedgehog 
treatment can be scored for by immunoassay. The patched protein, for example, is 
25 upregulated in drosophila in response to Dros-HH, and, in light of the findmgs herein, a 
presumed vertebrate homolog will similarly be \j5)regulated. Thus, detection of patched 
expression on the cells of the patient sample can permit detection of tissue that is not 
hedgehog-responsive. Likewise, scoring for other phenotypic markers provides a means for 
determining the response to hedgehog. 

30 Furthermore, by making available purified and recombinant hedgehog polypeptides, 

the present invention facilitates the development of assays which can be used to screen for 
drugs, including hedgehog homologs, which are either agonists or antagonists of the noraial 
cellular function of the subject hedgehog polypeptides, or of their role m the pathogenesis of 
cellular differentiation and/or proliferation and disorders related thereto. In one embodiment, 

35 the assay evaluates the ability of a compound to modulate binding between a hedgehog 
polypeptide and a hedgehog receptor. A variety of assay formats will suffice and, in light of 
the present inventions, will be comprehended by skilled artisan. 
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In many drug screening programs which test libraries of compoimds and natural 
extracts, high throughput assays are desirable in order to maximize the number of compounds 
surveyed in a given period of time. Assays which are performed in cell-free systems, such as 
may be derived with purified or semi-purified proteins, are often preferred as "primary" 
5 screens in that they can be generated to permit rapid development and relatively easy 
detection of an alteration in a molecular target which is mediated by a test compound. 
Moreover, the effects of cellular toxicity and/or bioavailability of the test compoimd can be 
generally ignored in the in vitro system, the assay instead being focused primarily on the 
effect of the drug on the molecular target as may be manifest in an alteration of binding 

10 affinity with receptor proteins. Accordingly, in an exemplary screening assay of the present 
invention, the compoimd of interest is contacted with a hedgehog receptor polypeptide which 
is ordinarily capable of binding a hedgehog protein. To the mixture of the compound and 
receptor is then added a composition containing a hedgehog polypeptide. Detection and 
quantification of KCQptor/hedgehog complexes provides a means for determining the 

15 compound's efficacy at inhibiting (or potentiating) complex formation between the receptor 
protein and tiie hedgehog polypeptide. The efficacy of the compound can be assessed by 
generating dose response curves from data obtained using various concentrations of the test 
compound. Moreover, a control assay can also be performed to provide a baseline for 
comparison. In tiie control assay, isolated and prm&ed hedgehog polypeptide is added to a 

20 composition containing the recq>tor protein, and the formation of xeceptoi/hedgehog complex 
is quantitated in the absence of tiie test compound. 

In an illustrative embodiment, the polypeptide utilized as a hedgehog receptor can be 
generated from the drosophila patched protein or a vertebrate homolog thereof In light of 
the ability of, for example, Shh to activate Dros-HH pathways in transgenic drosophila (see 

25 Example 4), it may be concluded that vertebrate hedgehog proteins are capable of binding to 
Drosophila HH receptors. Accordingly, an exemplary screening assay includes a suitable 
portion of the patched protein (SEQ ID No. 42), such as one or both of the substantial 
extracellular domains (e.g. residues Lys-93 to His-426 and Arg-700 to Arg-966). For 
instance, the patched protein can be provided in soluble form, as for example a preparation of 

30 one of the extracellular domains, or a preparation of both of the extracellular domains which 
are covalentiy connected by an unstructured linker (see, for example, Huston et al. (1988) 
PNAS 85:4879; and U.S. Patent No. 5,091,513), or can be provided as part of a liposon^ 
preparation or expressed on the surface of a cell. 

Complex formation between the hedgehog polypeptide and a hedgehog receptor may 
35 be detected by a variety of techniques. For instance, modulation of the formation of 
complexes can be quantitated using, for example, detectably labeled proteins such as 
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radiolabelled, fluorescently labeled, or enzymatically labeled hedgehog polypeptides, by 
inununoassay, or by chromatographic detection. 

Typically, it will be desirable to immobilize either the hedgehog receptor or the 
hedgehog polypeptide to &cilitate separation of rec&ptoT/hedgehog complexes ISrom 

5 uncomplexed forms of one of the proteins, as well as to accommodate automation of the 
assay. In one embodiment, a fusion protein can be provided which adds a domain that allows 
the protein to be bound to a matrix. For example, gluta^one-S-tiansferase/!receptor 
(GST/ieceptor) fusion proteins can be adsorbed onto glutathione sepharose beads (Sigma 
Chemical, St. Louis, MO) or glutathione dmvatized microtitre plates, which are then 

1 0 combined with the hedgehog polypeptide, e.g. an ^^S-labeled hedgehog polypeptide, and the 
test compound and incubated under conditions conducive to complex formation, e.g. at 
physiological conditions for salt and pH, though slightly more stringent conditions may be 
desired. Following incubation, the beads are washed to remove any unbound hedgehog 
polypeptide, and the matrix bead-bound radiolabel determined dhectly (e.g. beads placed in 

15 scintillant), or in the siQ)ematant after the rcc&ptor/hedgehog complexes are dissociated. 
Alternatively, the complexes can dissociated from the bead, separated by SDS-PAGE gel, 
and the level of hedgehog polypeptide found in the bead fraction quantitated from the gel 
using standard electrophoretic techniques. 

Other techniques for immobilizing proteins on matrices are also available for use in 

20 &e subject assay. For instance, soluble portions of the hedgehog receptor protein can be 
immobilized utilimig conji^ation of biotin and streptavidin. For instance, biotinylated 
receptor molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques well known in the art (e.g., biotinylation kit. Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidm-coated 96 well plates (Pierce Chemical). 

25 Alternatively, antibodies reactive with the hedgehog receptor but which do not interfere with 
hedgehog binding can be derivadzed to the wells of the plate, and the iecq)tor trapped in the 
wells by antibody conjugation. As above, preparations of a hedgehog polypeptide and a test 
compound are incubated in the receptor-presenting wells of the plate, and the amount of 
rectptor/hedgehog complex trapped in the well can be quantitated. Exemplary methods for 

30 detecting such complexes, in addition to those described above for the GST-immobilized 
complexes, include immunodetection of complexes using antibodies reactive with the 
hedgehog polypeptide, or which are reactive with the receptor protein and compete for 
binding with the hedgehog polypeptide; as well as enzyme-linked assays which rely on 
detecting an enzymatic activity associated with the hedgehog polypeptide. In the instance of 

35 the latter, the enzyme can be chemically conjugated or provided as a frision protein with the 
hedgehog polypeptide. To illustrate, the hedgehog polypeptide can be chemically cross- 
linked or genetically fused with alkaline phosphatase, and the amount of hedgehog 
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polypeptide trapped in the complex can be assessed with a chromogenic substrate of the 
enzyme, e.g. paranitrophenylphosphate. Likewise, a fusion protein comprising the hedgehog 
polypeptide and glutathione-S-transferase can be provided, and complex formation 
quantitated by detecting the GST activity using l-chloro-2,4-dinitrobenzene (Habig et al 
5 (1974) J Biol Chem 249:7130), 

For processes vMch rely on immunodetection for quantitating one of the proteins 
trapped in the complex, antibodies against the protein, such as the mix-hedgehog antibodies 
described herein, can be used. Alternatively, the protein to be detected in the complex can be 
"epitope tagged** m the form of a fusion protein which includes, in addition to the hedgehog 

10 polypeptide or hedgehog recqptor sequence, a second polypeptide for which antibodies are 
readily available (e.g, from commercial sources). For instance, die GST fusion proteins 
described above can also be used for quantification of binding using antibodies against the 
GST moiety. Other useful epitope tags include myc-epitopes (e.g., see Ellison et al. (1991) J 
Biol Chem 266:21150-21157) which includes a 10-residue sequence from c*myc, as well as 

15 the pFLAG system (International Biotechnologies, Inc.) or the pEZZ-protein A system 
(Pharamacia, NJ). 

Where the desired portion of the hh receptor (or other hedgehog binding molecule) 
cannot be provided in soluble form, liposomal vesicles can be used to provide manipulatable 
and isolatable sources of the receptor. For example, both authentic and recombinant forms of 
20 &e patched protein can be reconstituted m artificial lipid vesicles (e.g. phosphatidylcholine 
liposomes) or in cell membrane-derived vesicles (see, for example, Bear et al. (1992) Cell 
68:809-818; Newton et al. (1983) Biochemistry 22:61 10-61 17; and Reber et al. (1987) J Biol 
C/icw 262:11369-1 1374). 

In addition to cell-free assays, such as described above, the readily available source of 
vertebrate hedgehog proteins provided by the present invention also facilitates the generation 
of cell-based assays for identifying small molecule agonists/antagonists and the like. 
Analogous to the cell-based assays described above for screening combinatorial libraries, 
cells which are sensitive to hedgehog induction can be contacted with a hedgehog protein and 
a test agent of interest, with the assay scoring for modulation in hedgehog inductive 
responses by the target cell in the presence and absence of the test agent. As with the cell- 
fi:ee assays, agents which produce a statistically significant change in hedgehog activities 
(either inhibition or potentiation) can be identified. In an illustrative embodiment, motor 
neuron progenitor cells, such as from neural plate e^iants, can be used as target cells. 
Treatment of such explanted cells with, for example, Shh causes the cells to differentiate into 
motor neurons. By detecting the co-expression of the LIM homeodomain protein Islet-1 
(Thor et al. (1991) Neuron 7:881-889; Ericson et al. (1992) Science 256:1555-1560) and the 
inununoglobulin-like protein SCI (Tanaka et al. (1984) Dev Biol 106:26-37), the ability of a 
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candidate agent to potentiate or inhibit Shh induction of motor neuron differentiation can be 
measured. The hedgehog protein can be provided as a purified sotirce, or in the form of 
cells/tissue vdiich express the protein and which are co-cultured with the target cells. 

In yet another embodiment, the method of the present invention can be used to isolate 

5 and clone hedgehog receptors. For example, purified hedgehog protems of the present 
invention can be employed to precipitate hedgehog receptor proteins fix)m cell fractions 
prepared from cells ^ch are responsive to a hedgehog protein. For instance, purified 
hedgehog protein can be derivatized with biotm (using, for instance, NHS-Biotin, Pierce 
Chemical catalog no. 21420G), and die biotinylated protein utilized to saturate membrane 

10 bound hh receptors. The hedgehog receptors can subsequently be adsorbed or 

immobilized on streptavidin. If desired, the hedgehog-TGceptor complex can be cross-linked 
with a chemical cross-linking agent In such as manner, hh receptors can be purified, 
preferably to near homogeneity. The isolated hh receptor can then be partially digested vdtfa, 
for example, trypsm, and the resulting peptides separated by reverse-phase chromatography. 

15 The chromatography fragments are then analyzed by Edman degradation to obtain single 
sequences for two or more of the proteolytic firagments. From the chemically determined 
amino acid sequence for each of these tryptic fragments, a set of oligonucleotide primers can 
be designed for PCR. These primers can be used to screen both genomic and cDNA libraries. 
Similar strategies for cloning receptors have bem employed, for example, to obtain the 

20 recombinant gene for somatostatm receptors (Eppler et al. (1992) J Biol Chem 267:15603- 
15612). 

Other techniques for identifying hedgehog receptors by ^ression cloning will be 
evident in light of the present disclosure. For mstance, purified hh polypeptides can be 
immobilized in wells of micro titre plates and contacted with, for example, COS cells 

25 transfected witii a cDNA library (e.g., fix>m tissue expected to be responsive to hedgehog 
induction). From this panning assay, celb ^ch express hedgehog receptor molecules can 
be isolated on the basis of binding to the immobilized hedgehog protein. Another cloning 
system, described in PCT publications WO 92/06220 of Flanagan and Leder, involves the use 
of an expression cloning system whereby a hedgehog receptor is stored on the basis of 

30 binding to a hedgehog/^Sk3liat phosphatase fusion protein (see also Cheng et al. (1994) Cell 
79:157-168) 

Another aspect of the present invention relates to a method of inducing and/or 
maintaining a differentiated state, enhancing survival, and/or promoting proliferation of a cell 
responsive to a vertebrate hedgehog protein, by contacting the cells with an hh agonist or an 
35 hh antagonist as the circumstances may warrant For instance, it is contemplated by tiie 
invention that, in light of the present finding of an apparentiy broad involven^ent of hedgehog 
proteins m the formation of ordered spatial arrangements of differentiated tissues in 
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vertebrates, the subject method could be used to generate and/or msArstsAn an array of 
different vertebrate tissue both in vitro and in vivo. The hh agent, \Aether inductive or anti- 
inductive, can be, as appropriate, any of the preparations described above, including isolated 
polypeptides, gene therapy constructs, antisense molecules, peptidomimetics or agents 
identified in the drug assays provided herein. Moreover, it is contemplated that, based on the 
observation of activity of the vertebrate hedgehog proteins in Drosophila, hh agents, for 
purposes of therapeutic and diagnostic uses, can include the Dros-HH protein and homologs 
thereof. Moreover, the source of hedgehog protem can be, in addition to purified protem or 
recombinant cells, cells or tissue explants which naturally produce one or more hedgehog 
proteins. For instance, as described in Example 2, neural tube e)q)lants from embryos, 
particularly floorplate tissue, can provide a source for Shh polypeptide, which soxirce can be 
implanted in a patient or otherwise provided, as appropriate, for induction or maintenance of 
differentiation. 

For example, the present meAod is applicable to cell culture techniques. In vitro 
neuronal culture systems have proved to be fundamental and indispensable tools for the study 
of neural development, as well as the identification of neurotrophic &ctors such as nerve 
growth factor (NGF), ciliary trophic factors (CNTF), and brain derived neurotrophic factor 
(BDNF). Once a neuronal cell has become terminally-diflFerentiated it typically will not 
change to another terminally differentiated cell-type. However, neuronal cells can 
nevertheless readily lose their differentiated state. This is commonly observed when they are 
grown in culture from adult tissue, and when they form a blastema during regeneration. The 
present method provides a means for ensuring an adequately restrictive environment in order 
to maintain neuronal cells at various stages of differentiation, and can be employed, for 
instance, in cell cultures designed to test tiie specific activities of other trophic factors. In 
such embodiments of the subject method, the cultured cells can be contacted with an hh 
polypeptide, or an agent identified in the assays described above, in order to induce neuronal 
differentiation (e.g. of a stem cell), or to maintain the integrity of a culture of terminally- 
differentiated neuronal cells by preventing loss of differentiation. The source of hedgehog 
protein in the culture can be derived from, for example, a purified or semi-purified protein 
composition added directiy to the cell culture media, or alternatively, supported and/or 
released from a polymeric device which supports the growth of various neuronal cells and 
which has been doped with the protem. The source of the hedgehog protein can also be a 
cell that is co-cultured with the intended neuronal cell and which produces a recombinantor 
wild-type hedgehog protein. Alternatively, the source can be the neuronal cell itself which 
has been engmeered to produce a recombinant hedgehog protein. In an exemplary 
embodiment, a naive neuronal cell (e,g. a stem cell) is treated with an hh agonist in order to 
mduce differentiation of tiie cells into, for example, sensory neurons or, alternatively, 
motomeurons. Such neuronal cultures can be used as convenient assay systems as well as 
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souxces of implantable cells for therapeutic treatments. For example, hh polypeptides may be 
useful in establishing and maintaining the olfactory neuron cultures described in U.S. Patent 
5,318,907 and the like. 

According to the present invention, large numbers of non-tumorigenic neural 
5 progenitor cells can be perpetuated in vitro and induced to differentiate by contact with 
hedgehog proteins. Generally, a method is provided comprising the steps of isolating neural 
progeoitor cells from an animal, perpetuating these cells in vitro or in vivo, preferably in the 
presence of growth factors, and differentiating these cells into particular neural phenotypes, 
e.g., neurons and glia, by contacting the cells with a hedgehog agonist 

10 Progenitor cells are thought to be under a tonic inhibitory influence which maintains 

the progenitors in a suppressed state until their differentiation is required. However, recent 
techniques have been provided which permit these cells to be proliferated, and unlike neurons 
which are terminally differentiated and therefore non-dividing, they can be produced in 
unlhnited nxmiber and are highly suitable for transplantation into heterologous and 

IS autologous hosts with neurodegenerative diseases. 

By '^progenitor** it is meant an oligopotent or multipotent stem cell which is able to 
divide without limit and, under spedfic conditions, can produce daughter cells which 
terminally differentiate such as mto neurons and glia. These cells can be used for 
transplantation into a heterologous or autologous host By heterologous is meant a host other 
20 than the animal from which the progenitor cells were originally derived. By autologous is 
meant the identical host from which the cells were originally derived. 

Cells can be obtained from embryonic, post-natal, juvexule or adult neural tissue from 
any animal. By any animal is meant any multicellular animal which contains nervous tissue. 
More particularly, is meant any fish, reptile, burd, amphibian or mammal and the like. The 
25 most preferable donors are mammals, especially mice and humans. 

In the case of a heterologous 4onor animal, the animal may be euthanized, and the 
brain and specific area of interest removed using a sterile procedure. Brain areas of particular 
interest include any area fix)m which progenitor cells can be obtained which will serve to 
restore function to a degenerated area of tiie host's laain. These regions mclude areas of the 

30 central nervous system (CNS) including the cerebral cortex, cerebellum, midbrain, brainstem, 
spinal cord and ventricular tissue, and areas of the peripheral liervous system (PNS) including 
the carotid body and the adrenal medulla. More particularly, these areas include regions in the 
basal ganglia, preferably the striatum which consists of the caudate and putamen, or various 
cell groups such as the globus pallidus, the subthalamic nucleus, the niicleus basalis which is 

35 found to be degenerated in Alzheimer's Disease patients, or the substantia nigra pars 
compacta v^ch is found to be degenerated in Parkinson's Disease patients. 
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Human heterologous neural progenitor cells may be derived from fetal tissue obtained 
from elective abortion, or from a post-natal, juvenile or adult organ donor. Autologous neural 
tissue can be obtained by biopsy, or from patients undergoing neurosurgery in which neural 
tissue is removed, in particular during epilepsy surgery, and more particularly during 
5 temporal lobectomies and hippocampalectomies. 

Cells can be obtained from donor tissue by dissociation of individual cells from the 
connecting e?ctracellular matrix of the tissue. Dissociation can be obtained using any known 
jm>cedure, including treatment with enzymes such as trypsin, coUagenase and the like, or by 
using physical methods of dissociation such as with a blimt instrument Dissociation of fetal 
10 cells can be carried out in tissue culture medium, viiile a preferable medium for dissociation 
of juvenile and adult cells is artificial cerebral spinal fluid (aCSF). Regular aCSF contains 
124 mM NaCl, 5 mM KCl, 1.3 mM MgCl2, 2 mM CaCl2, 26 mM NaHCOs, and 10 mM D- 
glucose. Low Ca^~^ aCSF contains the same ingredients except for M gCl2 at a concentration 
of 3.2 mM and CaCl2 at a concentration of 0.1 mM. 

1 S Dissociated cells can be placed into any known culture medium capable of supporting 

cell growtii, including MEM, DMEM, RPMI, F-12, and the like, containing supplements 
\^ch are required for cellidar metabolism such as glutamine and other amino acids, 
vitamins, minerals and useful proteins such as transferrin and the like. Medium may also 
contain antibiotics to prevent contamination with yeast, bacteria and fungi such as penicillin, 

20 streptomycin, gentamicin and the like. In some cases, the medium may contain serum 
derived from bovine, equine, chicken and the like. A particularly preferable medium for cells 
is a mixture of DMEM and F-12. 

Conditions for culturing should be close to physiological conditions. The pH of the 
culture media should be close to physiological pH, preferably between pH 6-8, more 
25 preferably close to pH 7, even more particularly about pH 7.4. Cells should be cultured at a 
temperature close to physiological temperature, preferably between 30®C-40®C, more 
preferably between 32**C-38°C, and most preferably between 35*'C-37*'C. 

Cells can be grown in suspension or on a fbced substrate, but proliferation of the 
progenitors is preferably done in suiq>ension to generate large numbers of cells by formation 

30 of "neurospheres" (see, for example, Reynolds et al. (1992) Science 255:1070-1709; and PCT 
Publications WO93/01275, WO94/09119, WO94/10292, and W094/16718). In tiie case of 
propagating (or splitting) suspension cells, flasks are shaken well and the neurospheres 
allowed to settle on the bottom comer of the flask. The spheres are then transferred to a 50 
ml centrifuge tube and centrifuged at low speed. The medium is aspirated, the cells 

35 resuspended in a small amount of mediimi with growth factor, and the cells mechanically 
dissociated and resuspended in separate aliquots of media. 
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Cell suspensions in culture medium are supplemented with any growth factor which 
allows for the proliferation of progenitor cells and seeded in any receptacle capable of 
sustaining cells, though as set out above, preferably in culture flasks or roller bottles. Cells 
typically proliferate within 3-4 days in a 3TC incubator, and proliferation can be reinitiated 
S at any time after that by dissociation of the cells and resuspension in fresh medium containing 
growth factors. 

In the absence of substrate, cells lift off the floor of the flask and continue to 
proliferate in suspension forming a hollow sphere of undifferentiated cells. After 
approximately 3-10 days in vitro, the proliferating clusters (neurospheres) are fed every 2-7 
10 days, and more particularly every 2-4 days by gentle centrifiigation and resuspension in 
medium containing growth factor. 

After 6-7 days in vitro, individual cells in the neurospheres can be separated by 
physical dissociation of the neurospheres with a blunt instrument, more particularly by 
triturating the neurospheres with a pipette. Single cells from the dissociated neurospheres are 
15 suspended in culture medium containing growth factors, and differentiation of the cells can 
be induced by plating (or resuspending) the cells in the presence of a hedgehog agonist, and 
(optionally) any other factor capable of sustaining differentiation, such as bFGF and the like. 

To further illustrate other uses of hedgehog agonists and antagonists, it is noted that 
intracerebral grafting has emerged as an additional approach to central nervous system 

20 therapies. For example, one approach to repairing damaged brain tissues involves the 
transplantation of cells from fetal or neonatal animals into the adult brain (E>unnett et al. 
(1987) J Exp Biol 123:265-289; and Freund et al. (1985) J Neurosci 5:603-616). Fetal 
neurons from a variety of brain regions can be successfully incorporated into the adult brain, 
and such grafts can alleviate behavioral defects. For example, movement disorder induced by 

25 lesions of dopaminergic projections to the basal ganglia can be prevented by grafts of 
embryonic dopaminergic neurons. Complex cognitive functions that are impaired after 
lesions of the neocortex can also be partially restored by grafts of embryonic cortical cells. 
The use of hedgehog proteins or mimetics, such as Shh or DWi, in the culture can prevent 
loss of (Ufferentiation, or where fetal tissue is used, especially neuronal stem cells, can be 

30 used to induce differentiation. 

Stem cells useful in the present invention are generally known. For example, several 
neural crest cells have been identified, some of which are multipotent and likely represent 
uncommitted neural crest cells, and others of which can generate only one type of cell, such 
as sensory neurons, and likely represent committed progenitor cells. The role of hedgehog 
35 proteins employed in the present method to culture such stem cells can be to induce 
differentiation of the xmconraiitted progenitor and thereby give rise to a conmiitted progenitor 
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cell, or to cause further restriction of the developmental fate of a committed progenitor cell 
towards becoming a terminally-differentiated neuronal cell. For example, the present method 
can be used in vitro to induce and/or maintain the differentiation of neural crest cells into 
glial cells, schwaim cells, chromaffin cells, cholinergic sympathetic or parasympathetic 
5 neurons, as well as peptidergic and serotonergic neurons. The hedgehog protein can be used 
alone, or can be used in combination with other neurotrophic factors which act to more 
particularly enhance a particular differentiation fate of the neuronal progenitor cell. In the 
later instance, an hh polypeptide might be viewed as ensuring that the treated cell has 
achieved a particular phenotypic state such that the cell is poised along a certain 
10 developmental pathway so as to be properly induced upon contact with a secondary 
neurotrophic factor. In similar fashion, even relatively undifferentiated stem cells or 
primitive neuroblasts can be maintained in culture and caused to differentiate by treatment 
witii hedgehog agonists. Exemplary primitive cell cultures comprise cells harvested from the 
neural plate or neural tube of an embryo even before much overt differentiation has occurred. 

IS In addition to the implantation of cells cultured in die presence of a functional 

hedgehog activity and other in vitro uses described above, yet another aspect of the present 
invention concerns the therapeutic application of a hedgehog protein or mimetic to enhance 
survival of neurons and other neuronal cells in both the central nervous system and the 
peripheral nervous system. The ability of hedgehog protein to regulate neuronal 

20 differentiation during development of the nervous system and also presumably in the adult 
state indicates that certain of the hedgehog proteins can be reasonably expected to facilitate 
control of adult neurons with regard to maintenance, functional performance, and aging of 
normal cells; repair and regeneration processes in chemically or mechanically lesioned cells; 
and prevention of degeneration and premature death v^iiich resuU from loss of differentiation 

25 in certain pathological conditions. In light of this understanding, the present invention 
specifically contemplates applications of the subject method to the treatment of (prevention 
and/or reduction of the severity of) neurological conditions deriving from: (i) acute, 
subacute, or chronic injury to the nenrous system, including traumatic injury, chemical 
injury, vasal injury and deficits (such as the ischemia resulting firom stroke), together with 

30 infectious/inflanmiatory and tumor-induced injury; (ii) aging of the nervous system 
including Alzheimer's disease; (iii) chronic neurodegenerative diseases of the nervous 
system, mcluding Parkinson's disease, Huntington's chorea, amylotrophic lateral sclerosis and 
the like, as well as spinocerebellar degenerations; and (iv) chronic immunological diseases of 
the nervous system or affecting the nervous system, including multiple sclerosis. 

35 Many neurological disorders are associated with degeneration of discrete populations 

of neuronal elements and may be treatable with a therapeutic regimen which includes a 
hedgehog agonist. For example, Alzheimer's disease is associated with deficits in several 
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neurotransmitter systems, both those that project to the neocortex and those that reside with 
the cortex. For instance, the nucleus basalis in patients with Alzheimer's disease have been 
observed to have a profound (75%) loss of neurons compared to age-matched controls. 
Although Alzheimer's disease is by far the most common foim of dementia, several other 
5 disorders can produce dementia. Several of these are degenerative diseases characterized by 
the death of nexirons in various parts of the central nervous system, especially the cerebral 
cortex. However, some forms of dementia are associated with degeneration of the thalmus or 
the white matter underlying the cerebral cortex. Here, the cognitive dysfunction results from 
the isolation of cortical areas by the degeneration of efferents and afiferents. Huntington's 

10 disease involves the degeneration of intrastraital and cortical cholinergic neurons and 
GABAergic neurons. Pick's disease is a severe neuronal degeneration in the neocortex of the 
frontal and anterior temporal lobes, sometimes accompanied by death of neurons in the 
striatum. Treatment of patients suffering from such degenerative conditions can include the 
application of hedgehog polypeptides, or agents which mimic their effects, in order to 

15 control, for example, diffmntiation and apoptotic events vMch give rise to loss of neurons 
(e.g. to enhance survival of ^sting neurons) as well as promote differentiation and 
repopulation by progenitor cells in the area affected. In preferred embodiments, a source of a 
hedgehog agent is stereotactically provided within or proximate the area of degeneratioiL In 
addition to degenerative-induced dementias, a pharmaceutical prq)aration of one or more of 

20 the subject hedgehog proteins can be epphed opportunely in the treatment of 
neurodegenerative disorders vMch have manifestations of tremors and involuntary 
movements. Parkinson's disease, for example, primarily affects subcortical structures and is 
. characterized by degeneration of tiie nigrostriatal pathway, raphe nuclei, locus cereleus, and 
the motor nucleus of vagus. Ballism is typically associated with damage to tiie subthahnic 

25 nucleus, often due to acute vascular accident. Also included are neurogenic and myopadiic 
diseases which ultimately affect the somatic division of the peripheral nervous system and are 
manifest as neuromuscular disorders. Examples include chronic atrophies such as 
amyotrophic lateral sclerosis, Guillain-Barre syndrome and chronic peripheral neuropathy, as 
well as other diseases which can be manifest as progressive bulbar palsies or spinal muscular 

30 atrophies. The present method is amenable to the treatment of disorders of the cerebellum 
which result in hypotonia or ataxia, such as those lesions in the cerebellum which produce 
disorders in the limbs ipsilateral to the lesion. For instance, a preparation of a hedgehog 
homolog can be used to treat a restricted form of cerebellar cortical degeneration involving 
the anterior lobes (vermis and leg areas) such as is common in alcoholic patients. 

35 In an illustrative embodiment, the subject method is used to treat amyotrophic lateral 

sclerosis. ALS is a name given to a complex of disorders that comprise upper and lower 
motor neurons. Patients may present with progressive spinal muscular atrophy, progressive 
bulbar palsy, primary lateral sclerosis, or a combination of these conditions. The major 
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pathological abnomality is charactaized by a selective and progressive degeneration of the 
lower motor neurons in the spinal cord and the upper motor neurons in the cerebral cortex. 
The therapeutic application of a hedgehog agonist, particularly Dhh, can be used alone, or in 
conjunction with other neurotrophic factors such as CNTF, BDNF or NGF to prevent and/or 
5 reverse motor neuron degeneration in ALS patients. 

Hedgehog proteins of the present invention can also be used in the treatment of 
autonomic disorders of tiie peripheral nervous system, v«tich include disorders affecting the 
innervation of smootii muscle and endocrine tissue (such as glandular tissue). For instance, 
the subject method can be used to treat tachycardia or atrial cardiac arrytfamias which may 
10 arise finm a degenerative condition of the nerves innervating tiie striated muscle of the heart. 

Furthermore, a potential role for certain of the hedgehog protems, v\iiich is apparent 
from the appended examples, mainly the data of respecting hedgehog expression in sensory 
and motor neurons of the head and trunk (including limb buds), concerns the role of 
hedgehog proteins in development and maintenance of dendritic processes of axonal neurons. 

15 Potential roles for hedgehog proteins consequently include gmdance for axonal projections 
and the ability to promote differentiation and/or maintenance of the innervating cells to their 
axonal processes. Accordingly, compositions comprising hedgehog agonists or other 
hedgehog agents described herein, may be employed to support, or alternatively antagonize 
the survival and reprojection of several types of ganglionic neurons sympathetic and sensory 

20 neurons as well as motorneurons. In particular, such tiieri9)eutic conq)oations may be useful 
in treatments designed to rescue, for example, various neurons from lesion-induced death as 
well as guiding reprojection of fliese neurons after sudi damage. Such disease include, but 
are not linuted to, CNS trauma in&rction, infection (such as viral infection witii varicella- 
zoster), metabolic disease, nutritional deficiency, toxic agents (such as cisplatin treatment). 

25 Moreover, certain of the hedgehog agoits (such as antagonistic form) may be useful in the 
selective ablation of sensory neurons, for example, in the treatment of chronic pain 
syndromes. 

As appropriate, hedgehog agents can be used in nerve prostheses for the repair of 
central and peripheral nerve damage. In particular, where a crushed or severed axon is 

30 intubulated by use of a prostfietic device, hedgehog polypeptides can be added to tiie 
prostiietic device to increase the rate of growth and regeneration of tiie dendridic processes. 
Exemplary nerve guidance channels are described in U.S. patents 5,092,871 and 4,955,892. 
Accordingly, a severed axonal process can be directed toward tiie nerve ending from which it 
was severed by a prostiiesis nerve guide vAdch contains, e.g. a semi-solid formulation 

35 containing hedgehog polypeptide or mimetic, or ^ch is derivatized along tiie inner walls 
with a hedgehog protein. 
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In another embodiment, the subject method can be used in the treatment of neoplastic 
or hyperplastic transformations such as may occur in the central nervous system. For 
instance, certain of the hedgehog proteins (or hh agonists) which induce differentiation of 
neuronal cells can be utilized to cause such transformed cells to become either post-mitotic or 
5 apoptotic. Treatment with a hedgehog agent may facilitate disruption of autocrine loops, 
such as TGF-P or PDGF autostimulatory loops, which are believed to be involved in the 
neoplastic transformation of several neuronal tumors. Hedgehog agonists may, therefore, be 
of use in the treatment of, for example, malignant gliomas, medulloblastomas, 
neuroectodermal tumors, and ependymonas. 

1 0 Yet ano&er a^)ect of the present invention concerns the application of the discovery 

that hedgehog proteins are morphogmic signals involved in other vertebrate organogenic 
pathways in addition to neuronal differentiation as described above, having apparent roles in 
other endodermal patterning, as well as both mesodermal and endodermal differentiation 
processes. As described in the Examples below, Shh clearly plays a role in proper limb 

1 5 growth and patterning by initiating expression of signaling molecules, including Bmp-2 in the 
mesoderm and Fgf-4 in the ectoderm. Thus, it is contemplated by the invention that 
compositions comprising hedgehog proteins can also be utilized for both cell culture and 
therapeutic methods involving generation and maintenance of non-neuional tissue. 

In one embodiment, the present invention makes use of the discovery that hedgehog 
20 proteins, such as Shh, are appaxeaAy involved in controlling the development of stem cells 
responsible for formation of the digestive tract, liver, lungs, and other organs which derive 
from the primitive gut. As described in the Examples below, Shh serves as an inductive 
signal from the endodemi to the mesoderm, which is critical to gut morphogenesis. 
Therefore, for example, hedgehog agonists can be employed in the development and 
25 maintenance of an artificial liver viiich can have multiple metabolic functions of a normal 
liver. In an exemplary embodiment, hedgehog agonists can be used to induce differentiation 
of digestive tube stem cells to form hepatocyte cultures which can be used to populate 
extracellular matrices, or which can be encapsulated in biocompatible polymers, to form both 
implantable and extracorporeal artificial livers. 

30 In another embodiment, therapeutic compositions of hedgehog agonists can be 

utilized in conjunction with transplantation of such artificial livers, as well as embiyonic liver 
stractures, to promote inlx^eritoneal implantation, vascularization, and in vivo differentiation 
and maintenance of the engrafted liver tissue. 

In yet another embodiment, hedgehog agonists can be employed therapeutically to 
35 regulate such organs after physical, chemical or pathological insult For instance, therapeutic 
compositions comprising hedgehog agonists can be utilized in liver repair subsequent to a 
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partial hepatectomy. Similarly, therapeutic compositions containing hedgehog agonists can 
be used to promote regeneration of lung tissue in the treatment of emphysema. 

In still another embodiment of the present invention, compositions comprising 
hedgehog agonists can be used in the in vitro generation of skeletal tissue, such as from 
skeletogenic stem cells, as well as the in vivo treatment of skeletal tissue deficiencies. The 
present invention particularly contemplates the use of hedgehog agonists vAdch maintain a 
skeletogenic activity, such as an ability to induce chondrogenesis and/or osteogenesis. By 
"skeletal tissue deficiency", it is meant a deficiency in bone or other skeletal connective tissue 
at any site ^ere it is desired to restore the bone or connective tissue, no matter how the 
deficiency originated, e.g. vdiether as a result of surgical intervention, removal of tumor, 
ulceration, implant, firacture, or other traumatic or degenerative conditions. 

For instance, the present invention makes available efiective therapeutic methods and 
compositions for restoring cartilage function to a connective tissue. Such methods are useful 
in, for example, the repair of defects or lesions in cartilage tissue which is the result of 
degenerative wear such as that which results in arthritis, as well as other mechanical 
derangements which may be caused by trauma to the tissue, sudi as a displacement of torn 
meniscus tissue, meniscectomy, a laxation of a joint by a tom ligament, malignment of joints, 
bone firacture, or by hereditary disease. The present reparative method is also useful for 
remodeling cartilage matrix, such as in plastic or reconstructive surgery, as well as 
periodontal surgery. The present meAod may also be applied to improving a previous 
reparative procedure, for example, following surgical rq>air of a meniscus, ligament, or 
cartilage. Furthermore, it may prevent the onset or exacerbation of degenerative disease if 
applied early ^ough after trauma. 

In one embodiment of the present invention, the subject method comprises treating 
the afflicted connective tissue with a therapeutically sufficient amount of sl hedgehog ^onist, 
particularly an Ihh agonist, to generate a cartilage repair response in the connective tissue by 
stimulating the differentiation and/or proliferation of chondrocytes embedded in the tissue. 
Induction of chondrocytes by treatment with a hedgehog agonist can subsequently result in 
the synthesis of new cartilage matrix by the treated cells. Such connective tissues as articular 
cartilage, interarticular cartilage (menisci), costal cartilage (connecting the trae ribs and the 
sternum), ligaments, and tendons are particularly amenable to treatment in reconstructive 
and/or regenerative ther^ies using the sul:gect method. As used herein, regenerative 
therapies include treatment of degenerative states vMch have progressed to the point of 
which impairment of the tissue is obviously manifest, as well as preventive treatments of 
tissue where degeneration is in its earliest stages or imminent. The subject method can 
further be used to prevent the spread of mineralisation into fibrotic tissue by maintaining a 
constant production of new cartilage. 
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In an illustrative embodiment, the subject method can be used to treat cartilage of a 
diartbroidal joint, such as a knee, an ankle, an elbow, a hip, a wrist, a knuckle of either a 
finger or toe, or a tanperomandibular joint The treatment can be directed to the meniscus of 
the joint, to the articular cartilage of the joint, or both. To further illustrate, the. subject 

5 method can be used to treat a degenerative disorder of a knee, such as which might be the 
result of traumatic injury (e.g., a sports injury or excessive wear) or osteoarthritis. An 
injection of & hedgehog agonist into the joint with, for instance, an arthroscopic needle, can 
be used to treat the afOicted cartilage. In some instances, the injected agent can be in the 
form of a hydrogel or other slow release vehicle described above in order to peraiit a more 

10 extended and regular contact of the agent with the treated tissue. 

The present invention further contemplates 4e use of the subject method in the field 
of cartUage transplantation and prosthetic device therapies. To date, the growth of new 
cartilage fiom either transplantation of autologous or allogenic cartilage has been largely 
unsuccessfiil. Problems arise, for mslance, because the characteristics of cartilage and 

15 fibrocartilage varies between different tissue: such as between articular, meniscal cartilage, 
Ugaments, and tendons, between the two ends of the same Ugament or tendon, and between 
tiie superficial and deep parts of the tissue. The zonal ananganent of these tissues may 
reflect a gradual change in mechanical properties, and Mure occurs when implanted tissue, 
which has not differentiated under those conditions, lacks the ability to appropriately respond. 

20 For instance, vihai meniscal cartilage is used to repair anterior cruciate ligaments, the tissue 
undergoes a metiq)lasia to pure fibrous tissue. By promoting chondrogenesis, tiie subject 
method can be used to particularly addresses this problem, by causmg tiie implanted cells to 
become more adaptive to tiie new environment and effectively resemble hypertrophic 
chondrocytes of an earlier developmental stage of tiie tissue. Thus, tiie action of 

25 chondrogensis in tiie implanted tissue, as provided by tiie subject metiiod, and tiie mechanical 
forces on tiie actively remodeling tissue can synergize to produce an improved implant more 
suitable for the new fimction to which it is to be put. 

In similar feshion, the subject metiiod can be applied to enhancing botii tiie generation 
of prostiietic cartilage devices and to tiieir implantation. The need for improved treatment 

30 has motivated research aimed at creating new cartilage tiiat is based on collagen- 
glycosaminoglycan templates (Stone et al. (1990) Clin Orthop Relat Red 252:129), isolated 
chondrocytes (Grande et al. (1989) J Orthop Res 7:208; and Takigawa et al. (1987) Bone 
Miner 2:449), and chondrocytes attached to natural or syntiietic polymers (Walitani et al. 
(1989) J Bone Jt Surg 71B:74; Vacanti et al. (1991) Plast Recomtr Surg 88:753; von 

35 Schroeder et al. (1991) J Biomed Mater Res 25:329; Freed et al. (1993) J Biomed Mater Res 
27:11; and tiie Vacanti et al. U.S. Patent No. 5,041,138). For example, chondrocytes can be 
grown in culture on biodegradable, biocompatible highly porous scafifolds form d from 
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polymers such as polyglycolic acid, polylactic acid, agarose gel, or other polymers which 
degrade over time as function of hydrolysis of the polymier backbone into innocuous 
monomers. The matrices are designed to allow adequate nutrient and gas exchange to the 
cells until engraftment occurs. The cells can be cultured in vitro until adequate cell volume 
5 and density has developed for the cells to be implanted* One advantage of the matrices is that 
they can be cast or molded into a desired shape on an individual basis, so that the final 
product closely resembles the patient's own ear or nose (by way of example), or flexible 
matrices can be used which allow for manipulation at the time of implantation, as in a joint 

In one embodiment of the subject method, the implants are contacted with a hedgehog 
10 agonist during the culturing process, such z& m Ihh agonist, in order to induce and/or 
maintain di£ferentiated chondrocytes in the culture in order as to further stimulate cartilage 
matrix production within fhe implant In such a manner, the cultured cells can be caused to 
maintain a phenotype typical of a chondrogenic cell (Le. hypertrophic), and hrace continue 
the population of tiie matrix and production of cartUage tissue. 

15 In another embodiment, the implanted device is treated with a hedgehog agonist in 

order to actively remodel the implanted matrix and to make it more suitable for its intended 
function. As set out above with respect to tissue tran^lants, the artificial transplants suffer 
firom tiie same deficiency of not being derived in a setting which is comparable to the actual 
mechanical environment in \^ch the matrix is implanted. The activation of the 

20 chondrocytes in the matrix by the subject method can allow the implant to acquire 
characteristics similar to the tissue for which it is intended to replace. 

In yet another embodiment, the subject method is used to enhance attachment of 
prosthetic devices. To illustrate, the subject method can be used in the unplantation of a 
periodontal prosthesis, wherein the treatment of the surroxmding connective tissue stimulates 
25 formation of periodontal ligament about the prosthesis, as well as inhibits formation of 
fibrotic tissue proximate the prosthetic device. 

In still further embodiments, the subject method can be employed for the generation 
of bone (osteogenesis) at a site in the animal where such skeletal tissue is deficient. Indian 
hedgehog is particularly associated with fte hyp^trophic chondrocytes that are ultimately 

30 replaced by osteoblasts. For instance, administration of a hedgehog agent of the' present 
invention can be employed as part of a method for treating bone loss in a subject, e.g. to 
prevent and/or reverse osteoporosis and other osteopenic disorders, as well as to regulate 
bone growth and maturation. For example, preparations comprising hedgehog agonists can 
be employed, for example, to induce endochondral ossification, at least so far as to facilitate 

35 the formation of cartilaginous tissue precursors to form the "model" for ossification. 
Therapeutic compositions of hedgehog agonists can be supplemented, if required, with other 
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osteoinductive factors, such as bone growth factors (e.g. TGF-P factors, such as the bone 
morphogenetic factors BMP-2 and as well as activin), and may also include, or be 

administered in combination with, an inhibitor of bone resorption such as estrogen, 
bisphosphonate, sodium fluoride, calcitonin, or tamoxifen, or related compounds. However, 
S it will be ^^reciated that hedgehog proteins, such as Ihh and Shh are likely to be upstream of 
BMPs, e.g. hh treatment will have the advantage of initiatmg endogenous expression of 
BMPs along with other &ctors. 

In yet another embodiment of the present invention, a hedgehog antagonist can be 
used to inhibit spermatogenesis. Thus, in light of the present finding that hedgehog proteins 

10 are involved in tiie diffmntiation and/or proliferation and maintenance of testicular germ 
cells, hedgehog antagonist can be utilized to block &e action of a naturally-occurring 
hedgehog protein. In a preferred embodiment, the hedgehog antagonist inhibits the biological 
activity of Dhh with respect to spemiatogenesis, by competitively binding hedgehog 
receptors in the testis. In similar fashion, hedgehog agonists and antagonists are potentially 

1 5 useful for modulating normal ovarian function. 

The souxce of hedgehog polypeptides, whether for cell culture or for in vivo 
application, can be in the form of a purified protein composition, or can eb from a cell 
expressing eiUbsr a recombinant or endogenous form of the polypeptide, such as onbiyonic 
tissue (e.g., floor plate tissue explants). Moreover, is addition to those forms of the vertebrate 
20 hedgehog polypeptides described hmm, the present invention further contemplates the use of 
the drosophila hedgehog (Dros-HH) protein to induce cells and tissue of vertebrate 
organisms. 

In the instance of protein compositions, the hedgehog protein, or a pharmaceutically 
acceptable salt thereof, may be convenientiy formulated for administration with a biologically 

25 acceptable medium, such as water, buffered saline, polyol (for example, glycerol, propylene 
glycol, liquid polyethylene glycol and the like) or suitable mixtures thereof. The optimum 
concentration of the active ingredient(s) in tiie chosen medium can be determined 
empirically, according to procedures well known to medicinal chemists. As used herein, 
"biologically acceptable medium" includes any and all solvents, dispersion media, and the 

30 like which may be q)propriate for the desued route of administration of the phamoaceutical 
preparation. The use of such media for pharmaceutically active substances is known in the 
art. ExcqTt insofar as any conventional media or agent is incompatible with the activity of 
the hedgehog protein, its use in the phamiaceutical preparation of the invention is 
contemplated. Suitable vehicles and then: formulation inclusive of other proteins are 

35 described, for example, in the book Remington's Pharmaceutical Sciences (Remington's 
Pharmaceutical Sciences. Mack Pid>lishing Company, Easton, Pa., USA 1985). These 
vehicles include injectable "dq>osit formulations". Based on the above, such pharmaceutical 
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fonnulations include, although not exclusively, solutions or fi:eeze-dried powders of a 
hedgehog homolog (such as a Shh, Dhh or Kfhh) in association with one or more 
phannaceutically acceptable vehicles or diluents, and contained in buffered media at a 
suitable pH and isosmotic with physiological fluids. For illustrative purposes only and 
without being limited by the same, possible compositions or fommlations which may be 
prepared in the form of solutions for the treatment of nervous system disorders with a 
hedgehog protein are given in U.S. Patent No. 5,218,094. In the case of freeze-dried 
preparations, supporting excipients such as, but not exclusively, mannitol or glycine may be 
used and appropriate buffered solutions of the desired volmne will be provided so as to obtain 
adequate isotonic buffered solutions of the desired pH. Similar solutions may also be used 
for the pharmaceutical compositions of hh in isotonic solutions of the desired volimie and 
include, but not exclusively, tiie use of buffered saline solutions with phosphate or citrate at 
suitable concentrations so as to obtain at all times isotonic pharmaceutical preparations of the 
desired pH, (for example, neutral pH). 

Methods of introduction of exogenous hh at the site of treatment include, but are not 
limited to, intradermal, intramuscular, intn^ritoneal, intravenous, subcutaneous, oral, 
intranasal and topical. In addition, it may be desirable to introduce tiie pharmaceutical 
compositions of the invention into the central nervous system by any suitable route, including 
intraventricular and intrathecal injection. Intraventricular injection may be fiicilitated by an 
intraventricular catheter, for example, attached to a reservoir, such as an Onunaya reservoir. 

Methods of introduction may also be provided by rechargeable or biodegradable 
devices. Various slow release polymeric devices have been developed and tested in vivo in 
recent years for the controlled delivery of drugs, including proteinacious biopharmaceuticals. 
A variety of biocompatible polymers (including hydrogels), including both biodegradable and 
non-degradable polymers, can be used to form an implant for the sustained release of an M at 
a particular target site. Such embodunents of the present invention can be used for the 
delivery of an exogenously purified hedgehog protein, which has been incorporated in the 
polymeric device, or for the delivery of hedgehog produced by a cell encapsulated in the 
polymeric device. 

An essential feature of certain embodiments of the implant can be the linear release of 
the hh^ which can be achieved through the manipulation of the polymer composition and 
form. By choice of monomer composition or polymerization technique, the amount of water, 
porosity and consequent permeability characteristics can be controlled. The selection of the 
shape, size, polymer, and method for implantationi can be determined on an individual basis 
according to the disorder to be treated and the individual patient response. The generation of 
such implante is generally known in the art. See, for example. Concise Encyclopedia of 
Medical & Dental Materials^ ed. by David Williams (MIT Press: Cambridge, MA, 1990); 
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and the Sabel et al. U.S. Patent No. 4,883,666. In another embodiment of an implant, a 
source of cells producing a hedgehog protein, or a solution of hydogel matrix containing 
purified hh, is encapsulated in implantable hollow fibers. Such fibers can be pre-spun and 
subsequently loaded with the hedgehog source (Aebischer et al. U.S. Patent No. 4,892,538; 

5 Aebischer et al. U.S. Patent No. 5,106,627; Hoffinan et al. (1990) Expt. Neurobiol. 1 10:39- 
44; Jaeger et al. (1990) Prog Brain Res. 82:41-46; and Aebischer et al. (1991) J. Biomech 
Eng. 113:178-183), or can be co-extruded with a polymer which acts to form a polymeric 
coat about the hh source (Lim U.S. Patent No. 4,391,909; Sefton U.S. Patent No. 4,353,888; 
Sugamori et al. (1989) Trans. Am. ArHf. Intern. Organs 35:791-799; Sefton et al. (1987) 

10 Biotehnol Bioeng 29:1 135-1 143; and Aebischer et al. (1991) Biomaterials 12:50-55). 

In yet anotiier embodiment of the present invoition, the phannaceutical hedgehog 
protein can be administered as part of a combinatorial thers^ with other agents. For 
example, the combinatorial therapy can include a hedgehog protein with at least one trophic 
£u:tor. Exemplary trophic fectors include nerve growth fector, cilliary neurotrophic growfli 
15 fector, schwanoma-derived growth fector, glial growth factor, stiatal-detived neuronotrophic 
factor, platelet-derived growth fector, and scatter fector (HGF-SF). Antimitogenic agents can 
also be used, for example, when {ooliferation of surrounding glial cells or astrocytes is 
undesirable in Has regeneration of nerve cells. Examples of such antimitotic agents include 
cytosine, aialnnoside, 5-fluorouracil, hydroxyurea, and mediotrexate. 

20 Another aspect of the invention features transgenic non-human animals which exptess 

a heterologous hedgehog gene of the present invention, or \*ich have had one or more 
genomic hedgehog genes disnqjted in at least one of the tissue or cell-types of the animal. 
Accordingly, the invention features an animal model for developmental diseases, which 
animal has hedgehog allele which is mis-expressed. For example, a ntouse can be bred vMch 

25 has one or more /»A alleles deleted or odierwise rendered inactive. Such a mouse model can 
then be used to study disorders arising finm mis-aq»essed hedgehog genes, as well as for 
evaluating potmtial ther£q)ies for similar disorders. 

Another aspect of the present invention concerns transgenic animals which are 
comprised of cells (of that animal) which contain a transgene of the present invention and 

30 which preferably (though optionally) express an exogenous hedgehog protem in one or more 
cells in the animal. A hedgehog transgene can encode the wild-type form of the protein, or 
can encode homologs thereof, including bofli agonists and antagonists, as well as antisense 
constructs. In preferred embodiments, the expression of the transgene is restricted to specific 
subsets of cells, tissues or develoimiental stages utilizing, for example, cis-acting sequMces 

35 that control expression in the desired pattern. In the present invention, such mosaic 
expression of a hedgehog protem can be essential for many forms of lineage analysis and can 
additionally provide a means to assess the effects of; for example, lack of hedgehog 



95/18856 PCrAJSM/14992 

expression which might grossly alter development in small patches of tissue within an 
otherwise normal embryo. Toward this end, tissue-specific regulatory sequences and 
conditional regulatory sequences can be used to control expression of the transgene in certain 
spatial patterns. Moreover, temporal patterns of expression can be provided by, for example, 
5 conditional recombination systems or prokaryotic transcriptional regulatory sequences. 

Genetic techniques which allow for the expression of transgenes can be regulated via 
site-specific genetic manipulation in vivo are known to those skilled in the art For instance, 
genetic systems are available which allow for the regulated expression of a recombinase that 
catalyzes the genetic recombination a target sequence. As used herein, the phrase "target 

10 sequence** refers to a nucleotide sequence that is genetically recombined by a recombinase. 
The target sequence is flanked by recombinase recognition sequences and is generally either 
excised or inverted in cells expressing recombinase activity. Recombinase catalyzed 
recombination events can be designed such that recombination of tiie terget sequence results 
in either the activation or repression of expression of one of the subject hedgehog proteins. 

15 For example, excision of a target sequence which interferes with tiie expression of a 
recombinant hh gene, such as one which encodes an antagonistic homolog or an antisense 
transcript, can be designed to activate e:}q)ression of that gene. This interfmnce with 
es^ression of the protein can result fiom a variety of mechanisnis, such as spatial separation 
of the hh gene from the promoter element or an internal stop codon. Moreover, the transgene 

20 can be made wherein the coding sequence of the gene is flanked by recombinase recognition 
sequences and is initially transfected into cells in a 3' to 5* orientation with respect to the 
promoter element. In such an instance, inversion of the target sequence will reorient the 
subject gene by placing the 5* end of the coding sequence in an orientation with respect to the 
promoter element which allow for promoter driven transcriptional activation. 

25 In an illustrative embodiment, either the crelloxP recombinase system of 

bacteriophage PI (Lakso et al. (1992) PNAS 89:6232-6236; Qrban et al. (1992) PNAS 
89:6861-6865) or the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al. 
(1991) Science 251:1351-1355; PCT publication WO 92/15694) can be used to generate in 
vivo site-specific genetic recombination systems. Cre recombinase catalyzes the site-specific 

30 recombination of an intervening target sequence located between loxP sequences. JoxP 
sequences are 34 base pair nucleotide repeat sequences to which tiie Cre recombinase binds 
and are required for Cre recombinase mediated genetic recombination. The orientation of 
loxP sequences determines whether the intervemng target sequence is excised or inverted 
when Cre recombinase is present (Abremski et al. (1984) J. Biol Chem. 259:1509-1514); 

35 catalyzing the excision of the target sequence when the loxP sequences are oriented as direct 
repeats and catalyzes inversion of the target sequence vdaen loxP sequences are oriented as 
inverted repeats. 
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Accordingly, genetic recombination of the target sequence is dependent on e>q)ression 
of the Cie recombinase. E}q)ression of the recombinase can be regulated by promoter 
elements which are subject to regulatory control, e.g., tissue-specific, developmental 
stage-specific, inducible or repressible by externally added agents. This regulated control 
will result in genetic recombination of flie target sequence only in cells where recombinase 
expression is mediated by the promoter element. Thus, the activation expression of a 
recombinant hedgehog protem can be regulated via control of recombinase expression. 

Use of the crefloxP recombinase system to regulate expression of a recombinant hh 
protein requires the construction of a transgenic animal containing transgenes encoding both 
the Cre recombinase and the subject protein. Animals containing both the Cre recombinase 
and a recombinant hedgehog gene can be provided through the construction of "double" 
transgenic animals. A convenient method for providing such animals is to mate two 
transgenic animals each containing a transgene, e.g., an hh gene and recombinase gene. 

One advantage derived fi-om initially constructing transgenic animals containing a 
hedgehog transgene in a recombinase-mediated expressible fonnat derives firom the 
likelihood that the subject protein, whether agonistic or antagonistic, can be deleterious upon 
expression in die transgenic animal. In such an instance, a founder population, in vAdch the 
subject transgene is sUent in all tissues, can be propagated and maintained. Individuals of 
this founder population can be crossed wifli animals expressing die recombinase in, for 
example, one or more tissues and/or a deared temporal pattern. Thus, ibs creation of a 
founder population in which, for example, an antagonistic hh transgene is silent will allow 
the study of progeny ftom that founder in which disruption of hedgehog mediated induction 
in a particular tissue or at certain developmental stages would result in, for exaiiq>le, a letiwl 
I^ienotype. 

Similar conditional transgenes can be provided using prokaryotic promoter sequences 
which require prokaryotic proteins to be simultaneous expressed in order to fecilitate 
expression of the hedgehog transgene. Exemplary promoters and the corresponding trans- 
activating prokaryotic proteins are given in U.S. Patent No. 4,833,080. 

Moreover, expression of the conditional transgenes can be induced by gene therapy- 
like mediods whefein a gene encoding the trans-activating protein, e.g. a recombinase or a 
prokaryotic protein, is delivered to the tissue and caused to be expressed, such as in a cell- 
type specific manner. By tiiis metiiod, a hedgehog transgene could remain silent into 
adulthood until "turned on" by the introduction of the trans-activator. 

In an exemplary embodiment, tiie "transgaiic non-human animals" of the invention 
are produced by introducing transgenes into the germline of tiie non-human animal. 
Embryonic target cells at various developmental stages can be used to introduce transgenes. 
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Different methods are used depending on the stage of development of the embiyonic target 
cell The zygote is the best target for micro-injection. In the mouse, the male pronucleus 
reaches the size of approximately 20 micrometers in diameter which allows reproducible 
injection of l-2pl of DNA solution. The use of zygotes as a target for gene transfer has a 
5 major advantage in that in most cases the injected DNA will be incorporated into the host 
gene before the first cleavage (Brinster et al. (1985) PNAS 82:4438-4442). As a consequence, 
all cells of the transgenic non-himian animal will carry the incorporated transgene. This will 
in general also be reflected in the efficient transmission of the transgene to offspring of the 
founder since 50% of the germ cells will harbor the transgene. Microinjection of :^gotes is 
1 0 the preferred method for incorporating transgenes in practicing the invention. 

Retroviral infection can also be used to introduce hedgehog transgenes into a non- 
human animal. The developing non*buman embryo can be cultured in vitro to the blastocyst 
stage. During fhis time, the blastomeres can be targets for retroviral infection (Jaenich, R. 
(1976) PNAS 73:1260-1264). Efficient infection of the blastomeres is obtained by enzymatic 

15 treatment to remove the zona pellucida {Manipulating the Mouse Embryo, Hogan eds. (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, 1986). The viral vector ^stem used to 
introduce the transgene is typically a replication-defective retrovirus carrying the transgene 
(Jahner et al. (1985) PNAS 82:6927-6931; Van der Putten et al. (1985) PNAS 82:6148^152). 
Transfection is easily and efficiently obtained by culturing the blastomeres on a monolayer of 

20 virus-producmg cells (Van der Putten, supra; Stewart et al. (1987) EMBO J. 6:383-388). 
Alternatively, infection can be performed at a later stage. Virus or virus-producing cells can 
be injected into the blastocoele (Jahner et al. (1982) Nature 298:623-628). Most of the 
foimders will be mosaic for the transgene since incorporation occurs only in a subset of the 
cells which formed the transgenic non-human animal. Further, the founder may contain 

25 various retroviral insertions of the transgene at different positions in the genome which 
generally will segregate in the offspring. In addition, it is also possible to introduce 
transgenes into the germ line by intrauterine retroviral infection of the midgestation embryo 
(Jahner et al. (1 982) supra), 

A third type of target cell for transgene introduction is the embryonic stem cell (ES). 
30 ES cells are obtained from pre-implantation embryos cultured in vitro and fissed with 

embryos (Evans et al. (1981) Nature 292:154-156; Bradley et al. (1984) Nature 309:255-258; 

Gossler et al. (1986) PNAS 83: 9065-9069; and Robertson et al. (1986) Nature 322:445-448). 

Transgenes can be efficiently introduced into the ES cells by DNA transfection or by 

retrovirus-mediated transduction. Such transformed ES cells can thereafter be combined with 
35 blastocysts from a non-human animal. The ES cells thereafter colonize the embryo and 

contribute to the germ line of the resulting chimeric animal. For review see Jaenisch, R. 

(1988) Science 240:1468-1474. 
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Methods of making hedgehog knock-out or disruption transgenic animals are also 
generally known. See, for example, Manipulating the Mouse Embryo, (Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1986). Recombinase dependent knockouts can 
also be generated, e.g. by homologoxis recombination to insert recombinase target sequences 
5 flanking portions of an endogenous hh gene, such that tissue specific and/or temporal control 
of inactivation of a hedgehog allele can be controlled as above. 

Exen^Ufication 

The invention, now being generally described, will be more readily understood by 
1 0 reference to the following examples, which are included merely for purposes of illustration of 
certain aspects and embodiments of the present invention and are not intended to limit the 
invention. 

Exmpkl 

15 Cloning and Expression of Chick Some Hedgehog 

(i) Experimental Procedures 

Using degenerate PCR primere, vHH50 (SEQ ID No:18), vHH30 (SEQ ID No:19) 
and vHH3I (SEQ ID No:20) corresponding to a sequence conserved between Drosophila 
hedgehog (DrosWiiSEQ ID No:34) (Lee, J.J. et al. (1992) Cell 71: 33-50; Mohler, J. et al., 

20 (1992) Development 115: 957-971) and mouse Indian hedgehog (Ihh) (SEQ ID No:10), a 220 
base pair (bp) fragment was amplified from chicken genomic DNA. From 15 isolates, two 
distinct sequences were cloned, pCHA (SEQ ID No:35) and pCHB (SEQ ID No:36), each 
highly homologous to mouse Ihh (Figure 1). A probe made from isolate pCHA did not 
detect expression in embryonic tissues. Isolate pCHB, however, detected a 4 kb message in 

25 KT^A prepared from embryonic head, trunk, or limb bud RNA. This cloned PCR fragment 
was therefore used as a probe to screen an unamplified cDNA library prepared from 
Hamburger Hamilton stage 22 (Hamburger, W. et al., (1951) J. Morph 88: 49-92) limb bud 
RNA as described below. 

A single 1.6 kilobase (kb) cDNA clone, pHH-2, was selected for characterization and 
30 was used in all subsequent analyses. The gene encoding for this cDNA was named Sonic 
Hedgehog (after the Sega computer game cartoon character). Sequencing of the entire cDNA 
confirmed the presence of a single long open reading frame potentially encoding for a protein 
of 425 amino acids (aa). The clone extends 220 bp upstream of the predicted initiator 
methionine and approximately 70 bp beyond the stop codon. No consensus polyadenylation 
35 signal could be identified in the 3' untranslated region. A second potential initiator 
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methionine occurs at amino acid residue 4, The putative translation initiation signals 
surrounding hoth methionines are predicted to be equally efficient (Kozak, M., (1987) Nuc 
Acids Res, 15: 8125-8132), When the pHH-2 Sonic cDNA is used to probe a northern blot of 
stage 24 embryonic chick RNA, a single mRNA species of approximately 4 kb is detected in 
5 both limb and trunk tissue. The message size was predicted by comparing it to the position 
of 18S and 28S ribosomal RNA. Hybridized mRNA was visualized after a two day exposure 
to a phosphoscreen. Because the Sonic cDNA clone pHH-2 is only 1.6 kb, it is likely to be 
missing approximately 2.4 kb of untranslated sequence. 

PCR Cloning 

10 All standard cloning techniques were performed according to Ausubel et. al. (1989), 

and all enzymes were obtained &om Boehringer Mannheim Biochemicals. Degenerate 
oligonucleotides corresponding to amino acid residues 161 to 237 of the Drosophila 
hedgehog pTOtQin (SEQ ID No:34) (Lee, J J. et. al., (1992) Cell 71: 33-50) were synthesized. 
These degenerate oligonucleotides, vHH50 (SEQ ID No: 18), vHH30 (SEQ ID No: 19), and 

15 vHH3I (SEQ ID No:20) also contained Eco RI, Cla I, and Xba I sites, respectively, on their 5* 
ends to facilitate subcloning. The nucleotide sequence of these oligos is given below: 

VHH50: 5^GGAATTCCCAG(CA)GITG(CI0AA(AG)GA(AG)(CA)(AG)I(GCT)IA^ 
VHH30: 5'-TCATCGATGGACCCA(GA)TC(GA)AAICCIGC(TC)TC-3' 
20 VHH31: 5'-GCTCTAGAGCTCIACIGCIA(GA)IC(GT)IGC-3' 

where I represents inosine. Nested PCR was performed by first amplifying chicken genomic 
DNA using the vHH50 and vHHSO primer pair and then further amplifying that product 
using the vHHSO and vHH3I primer pair. In each case the reaction conditions were: initial 
denaturation at 93** C for 2.5 min., followed by 30 cycles of 94** C for 45 s, 50** C for 1 min., 
25 72** C for 1, and a final incubation of 72** C for 5 min. The 220 bp PCR product was 
subcloned into pGEM7zf (Promega). Two unique clones, pCHA (SEQ ID No:35) and pCHB 
(SEQ ID No:36) were identified. 

DNA Sequence Analysis 

Nucleotide sequences were determined by the dideoxy chain termination method 
30 (Sanger, F. et al., (1977) Proa Natl Acad Sci, USA 74: 5463-5467) using Sequenase v2.0 T7 
DNA polymerase (US Biochemicals). 5* and 3' nested deletions of pHH-2 were generated by 
using the nucleases Exo III and SI (Erase a Base, Promega) and individual subclones 
sequenced. DNA and amino acid sequences were analyzed using both GCG (Devereux, J. et 
al., (1984) Nuc. Acids Res. 12: 387-394) and DNAstar software. Searches for related 
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sequences were done through the BLAST network service (Altschul, S.F. et al., (1990) Jl 
Mol Biol 215: 403-410) provided by the National Center for Biotechnology Information. 

Southern Blot Analysis 

Five (5) ^g of chick genomic DNA was digested with Eco RI and/or Bam HI, 
5 fractionated on a 1% agarose gel, and transferred to a nylon membrane (Genescreen, New 
England Nuclear). The filters were probed with 32p.iabeled hhsi or hhh at 42^C in 
hybridization buffer (0.5% BSA, 500 mM NaHP04, 7% SDS, 1 mM EDTA, pH 7.2; Church, 
G.M. et al., (1984) Proc. Natl Acad Sci USA 81: 1991-1995). The blots were washed at 63** 
C once in 0.5% bovine serum albumin, 50 mM NaHP04 (pH 7.2), 5% SDS, 1 mM EDTA 
10 and twice in 40 mM NaHP04 (pH 7.2), 1% SDS, ImM EDTA, and visualized on Kodak 
XAR-5fitai. 

Isolation Of Chicken Sonic cDNA Clones 

A stage 22 limb bud cDNA library was constructed in XgtlO using Eco RI/NotI 
linkers. Unamplified phage plaques (10^) were transferred to nylon filters (Colony/Plaque 

1 5 screen, NEN) and screened wifli a^^P-labelled pooled inserts from PCR clones pCHA (SEQ 
ED No:3S) and pCHB (SEQ ID No:36). Hybridization was performed at 42"" C in 50% 
fonnamide 2X SSC» 10% dextran sul&te, 1% SDS and washing as described in the Southern 
Blot procedure. Eight positive plaques were identified, purified and their cDNA inserts 
excised with EcoRI and subcloned into pBiuescript SK+ (Stratagene). All eight had 

20 approximately L7 kb inserts with identical restriction patterns. One, pHH-2, was chosen for 
sequencing and used in all further manipulations. 

Preparation Of Digoxigenin-Laheled Riboprobes 

Piasmid pHH-2 was linearized with Hind m and transcribed witii T3 RNA 
polymerase (for antisense probes) or with Bam HI and transcribed with T7 RNA polymerase 
25 according to the manufacturers instructions for the preparation of non-radioactive 
digoxigenin transcripts. Following the transcription reaction, RNA was precipitated, and 
resuspended in RN Ase-fiee water. 

Whole Mount In Situ Hybridization 

Whole-mount in situ hybridization was performed , using protocols modified fi'om 
30 Parr, B.A. et al,. (1993) Development 119: 247-261; Sasaki, H. et al, (1993) Development 
118: 47-59; Rosen, B. et al. (1993) Trends Genet 9: 162-167. Embryos fi^m incubated 
fertile White Leghorn eggs (Spafas) were removed from the egg and extra-embryonic 
membranes dissected in calcium/magnesiiun-free phosphate-buffered saline (PBS) at room 
temperature. Unless otherwise noted, all washes are for five minutes at room temperature. 
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Embtyos were fixed overnight at 4''C with 4% paraformaldehyde in PBS, washed twice with 
PBT (PBS with 0.1% Tween-20) at 4*'C, and dehydrated through an ascending methanol 
series in PBT (25%, 50%, 75%, 2 X 100% methanol). Embryos were stored at -20^C until 
further use. 

Both pre-limb bud and limb bud stage embryos were rehydrated through an 
descending methanol series followed by two washes in PBT. Limb bud stage onbryos were 
bleached in 6% hydrogen peroxide in PBT, washed three times with PBT, permeabilized with 
proteinase K (Boehringer, 2 )xg/ml) for 15 minutes, washed with 2 mg/ml glycine in PBT for 
10 minutes, and twice witii PBT. Pre-limb bud stage embryos were peimealibized (without 
prior incubation wifh hydrogen peroxide) by three 30 minute washes in RIPA bufifer (150 
mM NaCl, 1% NP-40, 0.5% Deoxycholate, 0.1% SDS, ImM EDTA, 50 mM Tris-HCl, pH 
8.0). In all subsequent steps, pre-limb bud and limb bud stage embryos were treated 
equivalently. Embryos were fixed with 4% paraformaldehyde/0.2% gluteraldehyde in PBT, 
washed four times with PBT, once with pre-hybridization buffer (50% formamide, 5 X SSC, 
1% SDS, 50 ^g/ml total yeast RNA, 50 p.g/ml heparin, pH 4.5), and mcubated with firesh pre- 
hybridizadon buffer for one hour at 70^C. The pre-hylnidization bufifer was then replaced 
with hybridization bufifer (pre-hybridization buffer with digoxigenin labeled riboprobe at I 
^g/ml) and incubated overnight at 70X. 

Following hybridization, embryos were washed 3 X 30 minutes at 70**C with solution 
1 (50% formamide, 5 X SSC, 1% SDS, pH 4.5), 3 X 30 mmutes at 70°G with solution 3 
(50% formamide, 2 X SSC, pH 4.5), and three times at room temperature with TBS (Tris- 
buffered saline with 2 mM levamisole) containing 0.1% Tween-20. Non-specific binding of 
antibody was prevented by preblocking embiyos in TBS/0.1% Tween-20 containing 10% 
heat-inactivated sheep serum for 2.5 hours at room temperature and by pre-incubating anti- 
digoxigenin Fab alkaline-phosphatase conjugate (Boehringo:) in TBS/0.1% Tween-20 
containing heat inactivated 1% sheep serum and qiproximately 0.3% heat inactivated chick 
embryo powder. After an overnight incubation at 4^C with the pre-adsorbed antibody in 
TBS/0.1% Tween-20 containing 1% sheep serum, embryos were washed 3X5 nunutes at 
room temperature with TBS/0.1% Tween-20, 5 X 1.5 hour room temperature washes with 
TBS/1% Twe«-20, and overnight witii TBS/1% Tween-20 at 4*'C. The bufifer was 
exchanged by washing 3 X 10 minutes with NTMT (lOOmM NaCl, 100 mM Tris-HCl, 50 
mM MgC12, 0.1% Tween-20, 2 mM levamisole). The antibody detection reaction was 
performed by incubating embryos with detection solution (NTMT with 0.25 mg/ml NBT and 
0.13 mg/ml X-Phos). In general, pre-lunb bud stage embryos were incubated for 5-15 hours 
and limb bud stage embryos 1-5 hours. After the detection reaction was deemed complete, 
embryos were washed twice with NTMT, once with PBT (pH 5.5), postfixed with 4% 
paraformaldehyde/0.1% gluteraldehyde in PBT, and washed several times with PBT. In 
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some cases embryos were cleared through a series of 30%. 50%, 70%, and 80% glycerol in 
PBT. Whole embryos were photographed under transmitted light using a Nikon zoom stereo 
microscope with Kodak Ektar 100 ASA film. Selected embryos were processed for frozen 
sections by dehydration in 30% sucrose in PBS followed by embedding in gelatin and 
5 freezing. 25 jmi cryostat sections were collected on superfrost plus slides (Fisher), 
rehydrated in PBS, and mounted with gelvatol. Sections were photographed with Nomarski 
optics using a Zeiss Axiophot microscope and Kodak Ektar 25 ASA film. 

(ii) Sequence Homolgy Comparison Between Chicken Sonic hh And Dros-HH And Other 
Vertebrate Sonic hh Proteins 

10 The deduced Sonic amino acid sequence (SEQ ID No:8) is shown and compared to 

the Drosophila hedgehog protein (SEQ ID No:34) in Figure 2. Over the entire open reading 
frame the two proteins are 48% homologous at the amino acids level. The predicted 
Drosophila protein extends 62 aa beyond that of Sonic at its anuno terminus. This N- 
terminal extension precedes the putative signal peptide (residues 1-26) of the fly protein 

15 (SEQ ID No:34), and has been postulated to be removed during processing of flie secreted 
form of Drosophila hedgehog (Lee, J.J. et al., (1992) Cell 71: 33-50). The sequence of 
residues 1-26 of the Sonic protein (SEQ ID No:8) matches weU with consensus sequences for 
eukaryotic signal peptides (Landry, S.J. et al., (1993) Trends. Biochem. Set 16: 159-163) and 
is therefore likely to serve that fimction for Sonic. Furfliermore, Figure 3 shows a hydropaAy 

20 plot (Kyte, J. et al., (1982) J. Mot BioU57: 133-148) indicating that residues 1-26 of the 
Sonic protein (SEQ ID No:8) exhibit a high hydrophobic moment in accord with identified 
eukaryotic signal peptides. Cleavage of the putative signal sequence should occur C-terminal 
to residue 26 according to the predictive method of von Henjie, G. (1986) Nucl. Acid Res. 
1 1 : 1986. A single potraitial N-linked glycosylation site is located at amino acid residue 282 

25 of the Sonic protein (SEQ ID No:8). The predicted Sonic protein does not contain any other 
strong consensus motifs, and is not homologous to any other proteins outside of the 
Hedgehog fdoaiy. 

The mouse (SEQ ID No:l 1) and zebrafish (SEQ ID No:12) homologs of Sonic have 
also been isolated. A comparison of Aese and the Drosophila sequence is shown 
30 schematically in Figure 4. All of the vertebrate proteins have a similar predicted structure: a 
putative signal peptide at their amino terminus, followed by an extraordinarily similar 182 
amino acid region (99% identity in chicken versus mouse and 95% identity in chicken versus 
zebrafish) and a less well conserved carboxy-termirjal region. 

(Hi) At Least Jh-ee Hedgehog Homologues Are Present In The Chicken Genome 

35 Since two distinct PGR products encoding for chidken hedgehogs were amplified 

from genomic DN A, the total number of genes in the chicken hedgehog family needed to be 
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estimated. The two PGR clones pCHA (SEQ ID No:35) and pCHB (SEQ ID No:36) were 
used to probe a genomic Southern blot under moderately stringent conditions as described in 
the above Experimental Procedures. The blot was generated by digesting S ^g of chick 
chromosomal DNA with EcoRI and BamHI alone and together. Each probe reacted most 
5 strongly with a distinct restriction fragment. For example, the blot probed with pCHA» 
shows three bands in each of the Bam HI lanes, one strong at 6.6 kb and two weak at 3.4 and 
2.7 kb. The blot probed with pCHB, shows the 2.7 kb band as the most intense, while the 3 .4 
and 6.6 kb bands are weaker. A similar variation of intensities can also be seen in the Bam 
HI/Eco RI and EcoRI lanes. Exposure times were 72 hr. This data mdicates that each probe 
10 recognizes a distinct chicken hedgehog gene, and that a third as yet uncharacterized chicken 
hedgehog homolog exists in the chicken genome. 

: (iv) Northern Amtysis Defining Sites Cf Sonic Trmscripti^^ 

Northern analysis was performed which confirmed that Sonic is expressed during 
chick development. The spatial and temporal expression of Sonic in the chick embryo from 
IS gastrulation to early organogenesis was determined by whole mount in situ hybridization 
using a riboprobe corresponding to the frill-length Sonic cDNA (SEQ ID No: 1). 

20^g total RNA isolated from stage 24 chick leg buds or bodies (without heads or 
limbs) was fractionated on a 0.8% agarose formaldehyde gel and transferred to a nylon 
membrane (Hybond N» Ameisham). The blot was probed with the 1.6 kb EcoRI insert from 
20 pHH-2. Random-primed a^Zp-labelled insert was hybridized at 42X hybridization buffer 
(1% BSA, SOOmM NaHP04, 7% SDS, 1 mM EDTA, pH 7.2) and washed at 63** C once m 
0.5% bovine serum albumm, 50 mM NaHP04 (pH 7.2), 5% SDS, 1 mM EDTA and once in 
40 mM NaHP04 (pH 7.2), 1% SDS, ImM EDTA. The unage was visualized using a 
phosphoimager (Molecular Dynamics) and photographed directly from the video monitor. 

25 (v) Expression Of Sonic I>iiringMid''Gastrulation 

Sonic message is detected in. the gastrulating blastoderm at early stage 4, the earliest 
stage analyzed. Staining is localized to the anterior end of the primitive streak in a region 
corresponding to Hensen's node. As gastrulation proceeds, the primitive streak elongates to 
its maximal cranial-caudal extent, afrer which Hensen*s node regresses caudally and the 

30 primitive streak shortens. At an early point of node regression. Sonic mRNA can be detected 
at the node and in midline cells anterior to the node. By late stage 5, when the node has 
migrated approximately one-third of the length of the fully elongated primitive streak, 
prominent Sonic expression is seen at the node and in the midline of the embryo, reaching its 
anterior lunit at the developing head process. Sections at a cranial level show that Sonic 

35 mRNA is confined to invaginated axial mesendoderm, tissue which contributes to foregut 
and notochord. More caudally, but still anterior to Hensen's node, staining of axial 
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mesoderm is absent and Sonic expression is confined to the epiblast. At the node itself, high 
levels of Some message are observed in an asymmetric distribution extending to the left of 
and posterior to the primitive pit. This asymmetric distribution is consistently observed (6/6 
embryos from stages 5-7) and is always located to the left of the primitive pit At the node, 
and just posterior to the node, Some expression is restricted to the epiblast and is not 
observed in either mesoderm or endoderm. The expression of Sortie in the dorsal epiblast 
layer without expression in underlymg axial mesoderm contrasts maricedly with later stages 
where Some egression in underlying mesoderm always precedes midline neural tube 
e^qiression. 

) (yi) Expression Of Sonic During Head Fold Stages 

During the formation and differentiation of the head process. Sonic mRNA is detected 
in midline cells of the neural tube, the foregut, and throughout most of the axial mesoderm. 
At stage 7, Sonic message is readily detected asymmetrically at the node and in ventral 
midline cells anterior to the node. The rostral limit of Sonic expression extends to the 
5 anterior-most portions of the embryo where it is expressed in the foregut and prechordal 
mesoderm (Adehnann, H.B., (1932) Am. J. Anat. 31, 55-101). At stage 8, expression of 
Sonic persists along the entire ventral midline anterior to Hensen's node, while the node 
region itself no longer expresses Sonic. Transverse sections at different axial levels reveal 
that at stage 8 Sonic is coexpressed in the notochord and the overlying ventromedial 
20 neuroectoderm from anterior to Hensen's node to the posterior fiareguL ITie levels of Sonic 
message are not uniform m the neural tube: highest levels are found at fte presumptive nud- 
and hindbrain regions with progressively lower levels anterior and posterior. The increasing 
graded expression in the neural tube from Hensen's node to the rostral brain may reflect the 
developmental age of the neuroectoderm as differentiation proceeds from posterior to 
25 anterior. At the anterior-most end of the embryo, expression is observed in midline cells of 
the dorsal and ventral foregut as well as in prechordal mesoderm. Although flw prechordal 
mesoderm is m mtimate contact with the overlying ectoderm, the latter is devoid of &>mic 
expression. 

(vii) Expression Of Sonic During Early CNSDifferenttatton 

30 At stages 1 0 through 14, Sonic expression is detected in the notochord, ventral neural 

tube (including the floor plate), and gut precursors. By stage 10, there is a marked expansion 
of the cephaUc neuroectoderm, giving rise to the fore- mid- and hind-brain. At stage 10, 
Sonic mFNA is abundantly expressed in the ventral midline of the hindbrain and posterior 
midbrain. This expression expands laterally m the anterior midbrain and posterior forebrain. 

35 Expression does not extend to the rostral forebrain at this or later stages. Sections reveal that 
Sonic is expressed m the notochord, the prechordal mesoderm, and the anterior midline of the 
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foregut. Expression in the neuroepithelium extends from the forebrain caudally. In the 
posterior-most regions of the embryo which express Sonic^ staining is found only in the 
notochord and not in the overlying neurectoderm. This contrasts with earlier expression in 
which the posterior domains of Sonic expression contain cells are located in the dorsal 
5 epiblast, but not in underlying mesoderm or endoderm. Midgut precursors at the level of the 
anterior intestinal portal also show weak Sonic expression. 

At stage 14, e}q>ression continues in all three geim layers. The epithelium of the 
closmg midgut expresses Sonic along with portions of the pharyngeal endoderm and anterior 
foregut Ectoderm lateral and posterior to the tail bud also exhibits weak expression. At this 

10 stage, Sonic is also expressed along entire length of the notochord which now extends 
rostrally only to the midbrain region and no longer contacts the neuroepithelium at the 
anterior ecA of the embryo. E:q)ression in head mesenchyme anterior to the notochord is no 
longer observed. In the neural tube Sonic is found along the ventral midline of the fore- mid- 
and hindbrain and posteriorly in the spinal cord. In the forebrain, expression is expanded 

15 laterally relative to the hindbrain. At midgut levels, expression of Sonic in the neural tube 
appears to extend beyond the floor plate into more lateral regions. As observed at stage 10, 
Sonic at stage 14 is found in the notochord, but not in the ventral neural tube in posterior- 
most regions of the embryo. When neuroectodermal expression is first observed in the 
posterior embryo, it is located in midline cells which appear to be in contact with the 

20 notochord. At later stages, expression continues in areas which show expression at stage 14, 
namely the CNS, gut epithelium including the ailaiitoic stalk, and axial mesoderm. 

(viii) Sonic Is Exjn'essed In Posterior Limb Bud Mesenchyme 

The limb buds initially form as local thickenings of the lateral plate mesoderm. As 
distal outgrowth occxirs during stage 17, Sonic expression becomes apparent in posterior 

25 regions of both the forelimb and the hindlimb. Sections through a stage 21 embryo at the 
level of the forelimbs reveal that expression of Sonic in limb buds is limited to mesenchymal 
tissue. A more detailed expression profile of Sonic during limb development is discussed 
below in Example 3. Briefly, as the limb bud grows out, expression of Sonic narrows along 
the anterior-posterior axis to become a thin stripe along the posterior margin closely apposed 

30 to the ectoderm. Expression is not found at more proximal regions of the bud. High levels of 
Sonic expression are maintained until around stage 25/26 ^en staining becomes weaker. 
Expression of Sonic is no longer observed in wing buds or leg buds after stage 28. 
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Mouse Sonic Hedgehog Is Implicated in the Regulation ofCNS and Limb Polarity 
(i) Experimental Procedures 
Isolation Of Hedgehog Phage Clones 

5 The initial screen for mammalian hh genes was performed, as above, using a 700bp 

PGR ftagment encompassing exons 1 and 2 of the Dros-HH gene. Approximately one 
million plaques of a 129/Sv Lambda Fix II genomic libraiy (Stratagene) were hybridized 
with an o 32p^TP labeled probe at low stringency ( 55*C in 6xSSC, 0.5%SDS, 5 x 
Denhardfs; final wash at 60»C in 0.5 x SSC, 0.1% SDS for 20'). Five cross hybridizing 

10 phage plaques corresponding to the Dhh gene were purified. Restriction enzyme analysis 
indicated that all clones were overlapping. Selected restriction enzyme digests were then 
performed to map and subclone one of fliese. Subclones in pGEM (Promega) or Bluescript 
(Stratagene) which cross-hybridized with the Dros-HH fiagment where sequenced using an 
ABI automatic DMA sequmcer. 

15 Mouse Ihh and Shh were identified by low stringency hybridization (as described 

above) with a chick Shh cDNA clone to one million plaques of an 8.5 day XgtlO mouse 
embryo cDNA Ubraiy (Fahmer, K. et al.. (1987) EMBO J. 6: 1265-1271). Phage plaques 
containing a 1.8kb Ihh and 0.64 and 2.8kb Shh inserts were identified. Inserts were excised 
and subcloned into Bluescript (Stratagene) for dideoxy chain termination sequencing using 

20 modified 17 DNA polymerase (USB). The larger Shh clone contamed a partially processed 
cDNA in which intron splicing at Ae ocon 1/2 junction had not occurred. 

To screen for additional Ihh and Shh cDNA clones, an 8.5 day XZAPII cDNA library 
was probed at high stringency (at 65»C in 6xSSC, 0.5% SDS, 5 x Denhardt's; final wash at 65 
"C in 0.bcSSC, 0.1% SDS for 30") wifli ibe Ihh and Shh mouse cDNA clones. No additional 
25 Ihh clones were identified. However several 2.6Vb, j^jpaiently fiill lengfli, Shh clones were 
isolated. The DNA sequence of Hts additional 5' coding region not present in Has original 
0.64 and 2.8kb Shh clones was obtained by analysis of one of the 2.6kb inserts. 

Northern Blot Analysis 

Expression of S!Wr was investigated by RNA blot analysis using 20 \xg of total RN A 
30 ftom adult bram, spleen, kidney, liver, lung, 16.5dpc brain, liver and lung; 9.5dpc to 17.5dpc 
whole embryo; 9.5dpc forebrain, midbrain and lO.Sdpc brain. RNA samples were 
electrophoretically separated on a 1.2% agarose gel, transferred and u.v. crosslinked to 
Genescreen (DuPont) and probed with 2X1 0^ cpm/ml of an a32p^TP labeled mouse Shh 
probe (2.8kb insert fcom Xgt 10 screen). Hybridization was performed at 42''C in 50% 
35 formamide 5x Denhardt's, 5xSSPE, 0.1%SDS, 6.5% dextran, 200jig/ml sahnon sperm DNA. 
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Final wash was at SS^'C in O.lxSSC, 0.1%SDS. The blot was exposed for 6 days in the 
presence of an intensifying screen. 

In Situ Hybridization, ^-Galactosidase Staining And Histological Analysis 

Embryos from 7.25 to 14.5dpc were analyzed for either Shh or HNF-3P expression by 
5 whole mount in situ hybridization to digoxygenin labeled RNA probes as described in 
Wilkinson, (1992) In situ Hybridization: A Practical Approach Oxford; Parr et al., (1993) 
Development 119:247-261. The mouse Shh probe was either a 2.8kb or 0.6kb RNA 
transcript generated by T7 (2.8kb) or T3 (0.6kb) transcription of Xbal and HindUI digests of 
Bluescript (Stratagene) subclones of the original Shh cDNA inserts. The HNF-3p probe was 
10 generated by HindUI linearization of a HNF-3P cDNA clone (Sasaki, H. et al., (1993) 
Development 118: 47-59) and T7 polymerase transcription of 1.6kb transcript. Embryos 
were photographed on an Olympus-SZH photomicroscope using Kodak Ektachrome EPY 
64T color slide film. 

Sections through wild type and WEXPl-CShh transgenic embryos were prepared and 
15 hybridized with 35s.UIP labeled RNA probes (Wilkinson, D.G. et al., (1987) Development 
99: 493-500). Sections were photographed as described in McMahon, AJP. et al., (1 992) Cell 
69: 581-595. 

P Staining of WEXP2-lacZ embryos with pwas performed according to Whiting, J. et 
al., (1991) Genes & Dev. 5: 2048-2059. General histological analysis of wildtype and 
20 WEXP2-CShh transgenic embryos was performed on parafBn sections of Bouin*s fixed 
embiyos counterstained with hematoxylin and eosin. Histological procedures were as 
described by Kaufinan, M.H. (1992) The Atlas of Mouse Development, London: Academic 
Press. Sections were photographed on a Leitz Aristoplan compound microscope iising 
Kodak EPY 64T color sUde fikn. 

25 DNA Constructs For Transgenics 

Genomic Wnt-^l fragments werb obtained by screening a XGEM12 (Promega) 129/Sv 
mouse genomic library with a 375 bp Mlul-Bglil fragment derived fit>m the fourth exon of 
the murine Wnt-l gene. One of the clones (Wl-15.1) was used in this study. 

As initial step towards the generation of the pWEXP2 expression vector, Wl-15.1 
30 was digested to completion with restriction en2ymes AatU arid C/oI, and a 2774 bp AatU" 
Clal firagment isolated. This fragment was ligated into ^^11 and Clal cut pGEM-7Zf vector 
(Promega), generating pWl-18. This plasmid was digested with Hindu and ligated to 
annealed oligonucleotides lad (SEQ ID No:21) and lac2 (SEQ ID No:22) generating pWl- 
ISS'*' which has a modified polylinker downstream of the Clal restriction site. This constmct 
35 (pWl-18S*) was digested with Clal and Bg/II and ligated with both the 2.5 kb 3* Clal - Bgm 
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exon-intron region and 5.5 kb 3' Bgfll -Bg/II W/j/-i enhancer, generating pWRES4. This 
construct contains a 10.5 kb genomic region which starts upstream of the Wnt-1 translation 
initiation codon (at an Aatn site approximately 1.0kb fiom the ATG) and extends to a BgV\ 
site 5.5 kb downstream of the Wnt-l polyadenylation signal. This plasmid also contains a 250 
bp region of the neomycin phosphotransferase (neo) gene inserted in inverse orientation in 
the 3' transcribed but untranslated region. Finally, to generate the WEXP2 expression vector, 
a 2 kb I fiagment was amplified fiwm pWRES4 using Sf-1 (SEQ ID No:23) and Sf-2 
(SEQ ID No:24) oligonucleotides. This amplified firagment was digested with Sf\ I and 
inserted into Sfi I linearized pWRES4, generating pWEXP2. This destroys the Wnt-l 
translation initiation codon, and replaces it by a polylinker containing Uru I, Eco RV, Sac II, 
and Est BI restriction sites, which are unique in pWEXP2. 

The WEXP2 - lacZ construct was obtained by inserting an oid-filled Bgl II - Xho I 
/acZ fiagment isolated firom the pSDIUacZpA vector in the IJru I cut pWEXP2 e3q>ression 
vector. Similarly, the WEXP2 - QSMfi construct was obtained 1^ inserting an end-filled XbO. 
cDNA fragment containing the full Chick Shh coding sequence (SEQ ID No:l) into the JVrw I 
cut WEXP2 ejqiression vector. 

Oligonucleotide sequences are as follows: 

lacl: 5'-AGCTGTCGACGCGGCCGCTACGTAGGTTACCGACGTCAAGCTTAGATCTC-3' 
lac2: 5'-AGCroAGATCTAAGCTTGACGTCGGTAACCTACGTAGCGGCCGCGTCGAC-3' 
Sf-1 : 5'-GATCGGCCAGGCAGGCCTCGCX3ATATCGTCACCGCGGTATTCGAA-3' 
Sf-2: 5'-AGTGCCAGTCGGGGCCCCCAGGGCCGCGCC-3' 

Production And Genotyping Of Transgenic Embryos 

Transgenic mouse embryos were generated by microinjection of linear DNA 
fragments into the male pronucleus of B6CBAF1/J (C57BL/6J X CBA/J) zygotes. CD-I or 
B6CBAF1/J females were used as recipients for injected embryos. mice embryos were 
collected at 9.5, 10.5, and 11.5 dpc, photographed usmg an Olympus SZH stereophoto- 
microscope on Kodak EPY-64T color slide fihn, then processed as desoibed earlier. 

WEXP2-/acZ and WEXP2-CS'Wi transgenic embryos were identified by PCR analysis 
of proteinase-K digests of yolk sacs. Briefly, yolk sacs were carefiilly dissected firee firon 
maternal and iembryonic tissues, avoiding cross-contamination between littermates, Haea 
washed once in PBS. After ovonight incubation at 55*'C in 50 |xl of PCR proteinase-K 
digestion buffer (McMahon, A.P. et aL, (1990) Cell 62: 1073-1085). 1 ^ll of heat-inactivated 
digest was subjected to polymerase chain reaction (PCR) in a 20 ^1 volume for 40 cycles as 
follows: 94'*C for 30 seconds, 55*C for 30 seconds, 72*C for 1 minute, with the reaction 
ingredients described previously (McMahon, A.P. et al., (1990) Cell 62: 1073-1085)). In the 
case of the WECP2 - lacZ transgenic embryos, oligonucleotides 137 (SEQ ID No-25) and 
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138 (SEQ ID No:26) amplify a 352 bp lacZ specific product. In the case of the WEXP2- 
CShh embryos, oligonucleotides WPR2 -specific) (SEQ ID No:27) and 924 (Chick 
iSfeA-specific) (SEQ ID No:28) amplify a 345 bp fragment spanning the insertion junction of 
the Chxck-Shh cDNA in the WEXP2 expression vector. Table 2 summarizes the results of 
5 WEXP2-C-5M transgenic studies, 

Oligonucleotide sequences are as follows: 
137: 5'-TACCACAGC.GGATGGTTCGG-3' 
138: 5* -GTGGTGGTtATC}CCGATCGC-3' 
WPR2: 5*-TAAGAGGCCTATAAGAGGCGG-3* 
10 924: 5'-AAGTCAGCCCAGAGGAGACT-3' 

(ii) Mouse hh Genes 

The combined screening of mouse genomic and 8.5 day post coitum (dpc) cDNA 
libraries identified three mammalian hh counterparts (Figure SA) which herein will be 
referred to as Desert, Indian and Sonic hedgehog (Dhh, Ihh and Shh, respectively). 

IS Sequences encoding Dhh (SEQ ID No:2) were determined from analysis of clones identified 
by low stringency screening of a mouse genomic library. DNA sequencing of one of five 
overlapping lambda phage clones identified three homologous regions encoding a single 
open reading fi^e interrupted by introns in identical position to those of the Dros-HH gene 
(Figure 5A). SpUcing across the exon 1/2 boundary was confirmed by polymerase chain 

20 reaction (PGR) amplification of first strand cDNA generated fiom adult testicular RNA. The 
partial sequwice of Ihh (SEQ ID No:3) and the complete sequence of Shh (SEQ ID No:4) 
coding regions were determined from the analysis of overl25)ping cDNA clones isolated from 
8.5 dpc cDNA libraries. The longest Shh clone, 2.6kb, appears to be full length v^^en 
compared with the Shh transcript present in embryonic RNAs. The 1.8kb partial length Ihh 

25 cDNA is complete at the 3' end, as evidenced by the presence of a polyadenylation consensus 
sequence and short poly A tail. 

Alignment of the predicted Dios-HH protein sequence (SEQ ID No:34) with those of 
the mouse Dhh (SEQ ID No:9), Ihh (SEQ ID NorlO) and Shh (SEQ ID No:l 1), and chick Shh 
(SEQ ID No:8) and zebrafish Shh (SEQ ID No: 12), reveals several interesting features of the 

30 AA-family (Figure 5A). All the vertebrate /1/2-proteins contain an amino terminal hydrophobic 
region of approxinwttely 20 amino acids immediately downstream of the initiation 
methionine. Althoi^ the properties of these new hh proteins have not been investigated, it 
is likely that this region constitutes a signal peptide and vertebrate hhs are secreted proteins. 
Signal peptide cleavage is predicted to occur (von Heijne, G., (1986) Nucleic Acids Research 

35 14: 4683-4690) just before an absolutely conserved six amino acid stretch, CGPGRG (SEQ 
ID No:29) (corresponding to residues 85-90)(Figure 5A), in all hh proteins. This generates 
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processed mouse Dhh (SEQ ID No:9) and Shh (SEQ ID No:ll) proteins of 41 and 44 kd, 
respectively. Interestingly, Dros-HH (SEQ ID No:34) is predicted to contain a substantial 
amino terminal extension beyond the hydrophobic domain suggesting that the Drosophila 
protein enters the secretory pathway by a type II secretory mechanism. This would generate 

5 a transmembrane tetiiered protein which would require subsequent cleavage to release a 43 
kd secreted form of the protein. In vitro analysis of Dros-HH is consistent with this 
interpretation (Lee, J.J. et al., (1992) Cell 71: 33-50), However, there also appears to be 
transitional initiation at a second methionine (position 51 of SEQ ID No:34) just upstream of 
tiie hydrophobic region (Lee, J J. et al., (1992) Cell 71 : 33-50), suggesting tiiat Dros-HH, like 

10 its vertebrate counterparts, may also be secreted by recognition of a conventional amino 
temunal signal peptide sequence. 

Data base searches for protein sequences related to vertebrate hh's failed to identify 
any significant homologies, excepting Dros-HH. In addition, searching the TROSITE" data 
bank of protein motifs did not reveal any peptide motifs which are conserved in the different 
15 hh proteins. Thus, the hhs represent a novel family of putative cell signaling molecules. 

One feature of the amino acid alignment is the high conservation of hh sequences. 
Vertebrate hhs share 47 to 51% amino acid identity witib Dros-HH throughout the predicted 
processed polypeptide sequmce (Figure 6). Dhh has a slightly higher identity than that of 
Ihh and Shh suggesting that Dhh may be the orthologue of Dros*HH. Conservation is hig)iest 

20 in tiie amino terminal half of the protems, indeed, firom position 85 (immediately after the 
predicted shared cleavage site) to 249, 62% of the amino acids are completely mvariant 
amongst the Drosophila and vertebrate proteins. Comparison of mouse Dhh, Ihh and Shh 
where their sequences ov^lap in this more conserved r^on, indicates that Ihh and Shh are 
more closely related (90% amino acid identity; residues 85 to 266) than with the Dhh 

25 sequence (80% ammo acid identity; residues 85 to 266). Thus, Ihh and Shh presumably 
resulted from a more recent gene duplication event 

Comparison of cross species . identity amongst Shh proteins reveals an eveii more 
striking sequence conservation. Throughout tiie entire predicted processed sequence mouse 
and chick Shh share 84% of amino acid residues (Figure 6). However, in the amino temunal 

30 half (positions 85 to 266) mouse and chick are 99% and mouse and zebrafish 94% identical 
in an 180 amino acid stretoh. Conservation Ms ofif rapidly after position 266 (Figure 5 A). 
SEQ ID No:40 shows the consensus sequence in the amino termmal half of all vertebrate Shh 
genes (human, niouse, chicken and zebrafish) identified to date. SEQ ID No:41 shows the 
consensus sequence in the ammo tenninal half of vertebrate hedgehog genes (Shh, Ihh, and 

35 Dhh) identified to date in different species (mouse, chicken, human and zebrafish). 
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In summaiy, hh family members are likely secreted proteins consisting of a highly 
conserved amino terminal and more divergent carboxyl terminal halves. The extreme 
interspecies conservation of the vertebrate Shh protein points to likely conservation of Shh 
function across vertebrate species. 

5 (Hi) Expression of Mouse Shh at the Axial Midline 

E5q>ression of Shh in the mouse was eicamined in order to explore the role of mouse 
Shh (SEQ ID No:l 1) in vertebrate development Northern blots of embryonic and adult UNA 
samples were probed with a radiolabelled mouse Shh cDNA probe. An Shh transcript of 
approxunately 2.6kb was detected in 9.5dpc whole embryo RNA, and 9.5 and lO.Sdpc brain 
10 RNA fractions. No expression was detected in total RNA samples from later embryonic 
stages. Of the late fetal and adult tissue RNAs examined Shh exptession was only detected in 
16.5dpc and adult lung. 

To better define the precise temporal and spatial exjmssion of Shh an extensive series 
of whole mount and serial section in situ hybridizations were performed using digoxygenin 

IS and ^Ss-radiolabelled RNA probes, respectively, to mouse embryo sanq>les from 7.25dpc 
(mid streak egg (^linder stage of gastrulation) to 13.Sdpc. 'Ho Shh expression is detected at 
mid-gastrulation stages (72Sdpc) prior to the CQ>pearance of the node, the mouse counterpart 
of the amphibian organizer and chick Hensen's node. When the primitive streak is fully 
extended and the midline mesoderm of the head process is emerging from the node (7.5 to 

20 7.75dpc), Shh is expressed exclusively in the head process. At late head fold stages, Shh is 
expressed in the node and midline mesoderm of the head process extending anteriorly under 
the presumptive brain. Just prior to somite formation, Shh exterids to the anterior limit of the 
midline mesoderm, underlying the presumptive midbrain. As somites are formed, the 
embryonic axis extends caudally. The notochord, which represents the caudal extension of 

25 the head process, also e}q>resses Shh^ and expression is maintained in the node. 

Interestingly, by 8 somites (8.5dpc) strong Shh expression appears in the CNS. 
Expression is initiated at the ventral midline of the midbrain, above the rostral limit of the 
head process. By 10 somites CNS ej^ressioh in the midline extends rostrally in the forebrain 
and caudally into the hindbrain and rostral spinal cord. Expression is restricted in the 

30 hindbrain to the presumptive floorplate, whereas midbrain esqiression extends ventro- 
laterally. In the forebrain, there is no morphological floor plate, however ventral Shh 
expression here is continuous with the midbrain. By 15 somites ventral CNS expression is 
continuous from the rostral limit of the diencephalon to the presiunptive spinal cord in 
somitic regions. Over the next 18 to 24 hrs, to the 25-29 somite stage, CNS expression 

35 intensifies and forebrain expression extends rostral to the optic stalks. In contrast to all other 
CNS regions, in the rostral half of the diencephalon, Shh is not expressed at the ventral 
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midline but in two strips immediately lateral to this area which merge again in the floor of the 
forebrain at its rostral limit. Expression of Shh in both the notochord and floorplate is 
retained until at least 13.5dpc. 

Several gcovps have recently reported the cloning and e3q)ression of vertebrate 
5 members of a family of transcription factors, related to the Drosophila forkhead gene. One 
of these, HNF-3p shows sev^ similarities in expression to Shh (Sasaki, H. et al., (1993) 
Development 118: 47*59) suggesting that may be a potential regulator of Shh. To 

investigate this possibility, direct comparison of HNF-3P and Shh expression was 
imdertaken. HNF-Sfi transcripts are first detected in the node (as previously reported by 

10 Sasaki, H. et al*, (1993) supra\ prior to the emergence of the head process and before Shh is 
expressed. From the node, expression proceeds anteriorly in the head process, similar to Shh 
^ression. Activation of HNF-Sfi withm the C!NS is first observed at 2-3 somites, in the 
presumptive mid and hindbrain, prior to the onset of Shh expression. By 5 sontdtes, 
expression in the midbrain broadens ventro-laterally, extends anteriorly into the forebrain and 

IS caudally in the presumptive floor plate down much of the neuraxis in the somitic region. 
Strong expression is maintained at this time in the node and notochord. However, by 10 
somites expression in the head process is lost and by 25-29 somites notochordal oq^ression is 
. only present in the most extreme caudal notochord. In contrast to the transient e?qpression of 
HNF'3^ in the midline mesoderm, expression in the floor plate is stably retained until at least 

20 1 1 .5dpc. Thus, there are several spatial similarities between the expression of HNF-S^ and 
Shh in both the midline mesoderm and ventral CNS and it is likely that both genes are 
expressed in the same cells. However, in both regions, HNF-3^ expression precedes that of 
Shh. The main differences are in the transient expression of HNF-3^ in the head process and 
notochord and Shh expression in the forebrain. Whereas HNF'3^ and Shh share a similar 

25 broad ventral and ventral lateral midbrain and caudal diencephalic expression, only Shh 
extends more rostrally into the forebrain. In general, these results are consistent with a model 
in which initial activation of Shh expression may be regulated by HNF-3p. 

The similarity in Shh and HNF'3^ expression domains is also apparent in the 
definitive endoderm which also lies at the midline. Broad HNF-3^ expression in the foregut 

30 pocket is apparent at 5 somites as previously reported by Sasaki, H. et al., (1993) supra. Shh 
is also e)^ressed in the endoderm, immediately beneath the forebrain. Both genes are active 
in the rostral and caudal endoderm from 8 to 1 1 somites. Whereas HNF-3^ is uniformly 
expressed, Shh expression is mitially restricted to two ventro-lateral strips of cells. Ventral 
restricted expression of Shh is retained in the most caudal region of the presumptive gut until 

35 at least 9.5dpc whereas HNF'3^ is imiformly xpressed along the dorso-ventral axis. Both 
genes are expressed in the pharyngeal ectoderm at 9.5dpc and expression is maintained in the 
gut imtil at least 11.5dpc. Moreover, e)q>ression of Shh in the embryonic and adult lung RNA 
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suggests that endodennal expression of Shh may continue in, at least some endodemi derived 
organs. 

(iv) Expression Of Shh In The Limb 

Expression of Shh is not confined to midline structures. By 30-35 somites (9.75dpc), 
S expression is detected in a small group of posterior cells in the forelimb bud. The forelimb 
buds form as mesenchymal outpocketings on the flanks, opposite somites 8 to 12, at 
approximately the 17 to 20 somite stage. Shh expression is not detectable in the forelimbs 
until about 30-35 somites, over 12 hours after the initial appearance of the limbs. Expression 
is exclusively posterior and restricted to mesenchymal cells. By 10.5dpc, both the fore and 

10 hindlimbs have elongated substantially from the body flank. At this time Shh is strongly 
expressed in the posterior, distal aspect of both limbs in close association with the overlying 
ectoderm. Analysis of sections at this stage detects Shh expression in an approximately six 
cell wide strip of posterior mesenchymal cells. In the forelimb, Shh expression ceases by 
ll.Sdpc. However, posterior, distal expression is still detected in the hindlimb. No limb 

1 5 expression is detected beyond 12.5dpc. 

(v) Ectopic Expression Cf Shh 

Grafting studies carried out principally in the chick demonstrate that cell signals 
derived from the notochord and floor plate pattern the ventral aspect of the CNS (as described 
above). In fhe limb, a transient signal produced by a group of posterior cells in both limb 
20 buds, the zone of polarizing activity (ZPA), is thought to regulate patterning across the 
anterior-posterior axis. Thus, the sequence of Shh^ which predicts a secreted protein and the 
expression profile in midline mesoderm, the floor plate and in the limb, suggest that Shh 
signaling may mediate pattern regulation in the vmtral CNS and limb. 

To determine whether Shh may regulate ventral development in the early manunalian 
25 CNS, a Wnt'l enhancer was used to alter its nomfial domain of expression. Wnt-l shows a 
dynamic pattern of expression which is initiated in the presumptive midbrain just prior to 
somite formation. As the neural folds elevate and fuse to enclose the neural tube, Wnt-J 
expression in the midbrain becomes restricted to a tight circle, just anterior of the midbrain, 
the ventral midbrain and the dorsal midline of die diencephalon, midbrain, myelencephalon 
\ 30 and spinal cord (Wilkinson, D.G. et al., (1987) Cell 50: 79-88; McMahon, A.P. et al., (1992) 
Cell 69: 581-595; Parr, B.A. et al., (1993) Development 1 19: 247-261). 

• It was determined that essentially normal expression of lacZ reporter constructs 

within the Wnt-l expression domain is dependent upon a 5.5kb enhancer region which lies 
downstream of the Wnhl polyadenylation sequence. A constmct was generated for ectopic 
35 expression of cDNA clones in the Wnt-l domain and tested in transgenics using a lacZ 
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reporter (pWEXP-lacZ; Figure 9). Two of the four Gq transgenic embryos showed readily 
detectable p-galactosidase activity, and in both e}q)ression occurred throughout tiie normal 
WntA expression domain. More extensive studies with a similar construct also containing the 
S.Skb enhancer gave similar firequencies. Some ectopic expression was seen in newly 
5 emerging neural crest cells, probably as a result of perdurance of |J-galactosidase RNA or 
piotein in the dorsally derived cresL Thus, the WntA expression construct allows the efficient 
ectopic expression of cDNA sequences in the midbrain and in the dorsal a^ect of much of 
theCNS. 

An SWi ectopic e^qsression construct (pWEXP*CSb/i) containing two tandem head to 
10 tail copies of a chick Shh cDNA vras generated (Figure 7). By utilizing this approach, ectopic 
expression of the chick Shh is distinguishable fix>m that of the endogenous mouse Shh gene. 
Chick shows a high degree of sequence identity and similar expression to the mouse 
gene. Thus, it is highly likely that function is widely conserved amoi^st vertebrates, a 
conclusion further supported by studies of the same gene in zebrafish. 

15 Table 2 shows the results of several transgenic experiments in which the Gq 

population was collected at 9.5 to 11.5dpc. Approximately half of the transgenic embryos 
identified at each stage of development had a clear, consistent CNS phenotype. As we 
expect, on the basis of control studies using the 5.5kb WntA enhancer, that only half the 
transgenics will express the transgene, it is clear that in most embryos ectopically expressing 

20 chick Shh^ an abnormal phenotype results. 



TABLE 2 

Summary of WEXP2-Chick Shh transgenic studies 



Age (dpc) 


Number of 
Embryos 


Number of 
Transgenics 


Number of Embryos with 
CNS phenotype^ 


9.5 


37 


11 


6(54.5%) 




59 


16 


8 (50%) 


\\5 


33 


7 


3 (42.9%) 



25 Figures in parentheses, refer to the percentage of transgenic embryos with a CNS phenotype 

^ In addition one 9.Spc and two IO.Spc transgenic embryos showed non-specific growA retardation, 
as occurs at low frequency in transgenic studies. These embiyos were excluded from iiirdier analysis* 

At 9.5dpc, embryos vsdth a weaker ph notype show an open neural plate fiom the mid 
diencephalon to the myelencephalon. In embryos with a stronger phenotype at the same 
30 stage, the enture diencq>halon is open and telencephalic and optic development is 
morphologically abnomial. As the most anterior diencephalic expression of WntA is lower 
than that in more caudal regions, the differences in severity may relate to differences in the 
level of chick &h expression in different Gq embryos. At &e lateral margins of the open 
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neural folds, where Wnt-l is normally expressed, there is a thickening of the neural tissue 
extending from the diencephalon to myelencq)halon. The cranial phenotype is similar at 
10«5 and 11. S dpc. However, there appears to be a retardation in cranial expansion of the 
CNS at later stages. 

5 In addition to the dorsal cranial phenotype, there is a progressive dorsal phenotype in 

the spinal cord. At 9.5 dpc, the spmal cord appears morphologically normal, except at 
extreme rostral levels. However by lO.Sdpc, there is a dorsal dysmoiphology extending to the 
fore or hindlimbs. By 11.5 dpc, all transgenic emfaiyos showed a dorsal phenotype along 
almost the ehture spinal cord. Superficially, the spinal cord had a rippled, undulating 

10 appearance suggestive of a change in cell properties dorsally. This dorsal phenotype, and the 
cranial phenotype were examined by histological analysis of transgenic embryos. 

Sections through a 9.5dpc embryo with an extreme CNS phenotype show a 
widespread dorsal perturbation in cranial CNS development. The neural/ectodermal junction 
in the diencephalon is abnormal. Neural tissue, which has a columnar epithelial morphology 

15 quite distinct from the squamous epithelium of the surface ectoderm, appears to spread 
dorsolaterally. The myelencephalon, like the diencephalon and midbrain, is open rostrally. 
Interestingly, there are discontinuous dorso-lateral regions in the myelencephalon with a 
morphology distinct from the normal roof plate regions close to the normal site of Wnt-l 
expression. These cells form a tight, polarized epithelium with basely located nuclei, a 

20 morphology similar to the floor plate and distinct from other CNS regions. Differentiation of 
dorsally derived neural crest occurs in transgenic embryos as can be seen from the presence 
of cranial ganglia. In the rostral spinal cord, the neural tube appeared distended dorso- 
laterally which may account for the superficial dysmorphology. 

By 1 l.Sdpc, CNS development is highly abnormal along the entire dorsal spinal cord 
to the hindlimb level. The dorsal half of the spinal cord is enlarged and distended. Dorsal 
sensory innervation occurs, however, the neuronal trajectories are highly disorganized. Most 
obviously, the morphology of dorsal cells in the spinal cord, which normally are elongated 
cells with distinct lightly staining nuclei and cytoplasm, is dramatically altered. Most of the 
dorsal half of the spinal cord consists of small tightly packed cells with darkly staining nuclei 
and little cytoplasm. Moreover, there appears to be many more of these densely packed cells, 
leading to abnormal outgrowth of the dorsal CNS. In contrast, ventral development is 
normal, as are dorsal root ganglia, whose origins lie m neural ceils derived from the dorsal 
spinal cord. 

(vi) Ectopic Shh Expression Activates Floor Plate Gene Expression 

35 To determine whether ectopic expression of chick Shh results in inappropriate 

activation of a ventral midline development in the dorsal CNS, expression of two floor plate 
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expressed genes, HNF-3P and mouse Shh, were examined. Whole mounts of 9.5dpc 
transgenic embryos show ectopic expression of HNF-3p throughout the cranial Wnt-l 
ocpression domain. In addition to normal expression at the ventral midline, HNF-3p 
transcripts are ejqjressed at high levels, in a circle just rostral to the mid/hindbram junction, 
along the dorsal (actually lateral in unfiised brain folds) aspects of the midbram and, more 
weakly, in Ae roof plate of the myelencephalon. No expression is observed in the 
metencephalon which does not express Wnt-l. Thus, ectopic expression of Shh leads to the 
activation of HNF-SP throughout the cranial Wnt-l expression domain. 

The relationship between chick Shh expression and the expression of HNF-Sp in 
serial sections was also examined. Activation of HNF-3P in the brain at 9.5 and lO.Sdpc is 
localized to the dorsal aspect in good agreement with the observed ectopic expression of 
chick SWi. Interestingly mouse SfeA is also activated dorsally. Thus, two eariy floor plate 
markers are induced in reqwnse to chick Shh. 

From 9.5dpc to 11.5dpc, the spinal cord phenotype becomes more severe. The 
possibility that activation of a floor plate pathway may play a role in the observed phenotype 
was investigated. In contrast to the brain, where ectopic HNF-33 and Shh transcripts are still 
present, little or no induction of these floor plate maricers is observed Thus, although the 
dorsal spinal cord shows a wideq)read transformadon in cellular phenoQrpe, this does not 
£qppear to result fiom the induction of floor plate development 

Cfack Sonic Hedgehog Mediates ZPA Activity 
(i) Eiqjerimental Procedures 
^tinoic Acid Bead Implants 

Fertilized white Leghorn chicken eggs were incubated to stage 20 and then implanted 
with AG1-X2 ion exchange beads (Biorad) soaked in 1 mg/ml retinoic acid (RA, Sigma) as 
described by Tickle, C. et al., (1985) Dev. Biol 109: 82-95. Briefly, the beads were soaked 
for 15 min in Img/ml RA in DMSO, washed twice and implanted under the AER on the 
anterior margin of the limb bud. After 24 or 36 hours, some of the in^lanted embryos were 
harvested and fixed overnight in 4% paraformaldehyde in PBS and then processed for whole 
mount in situ analysis as previously described. The remainder of the animals were allowed to 
develop to embryonic day 10 to confirm that the dose of RA used was c^le of inducing 
mirror unage duplications. C:ontrol animals were implanted with DMSO soaked beads and 
showed no abnormal phenotype or gene expression. 



wo 95/18856 



PCT/USSMyi4992 



Plasmids . ^7 

Unless otherwise noted, all standard cloning techniques were performed according to 
Ausubel, F.M. et al., (1989) Current Protocols in Molecular Biology (N.Y.: Greene 
Publishing Assoc. and Wiley Inerscience), and all enzymes were obtamed from Boehringer 
S Mannheim Biochemicals. pHH-2 is a cDNA contain the entire coding region of chicken 
Sonic hedgehog (SEQ ID No:l). RCASBP(A) and RCASBP(E) are replication-competent 
retroviral vectors which encode viruses with differing host ranges. RCANBP(A) is a variant 
of RCASBP(A) from viuch fte second splice acceptor has been removed. This results in a 
virus which can not express tiie inserted gene and acts as a control for the effects of viral 

10 infection (Hughes, S.H. et al., (1987) J. Virol 61: 3004-3012; Fekete, D. et al., (1993) Mol 
Cell Biol 13: 2604-2613). RCASBP/AP(E) is version of RCASBP(E) contaming a human 
placental alkaline phosphatase cDNA (Fekete, D. et al., (1993b) Proc. Natl Acad. Set USA 
90: 2350-2354). SLAX13 is a pBluescript SK+ derived plasmid with a second Cla I 
restriction site and the 5' untranslated region of v-src (from the adaptor plasmid CLA12-Nco, 

15 Hughes, S JH. et al., (1987) J, Virol 61: 3004-3012) cloned 5* of the EcoRI (and Clal) site in 
the pBluescript polylinker. RCASBP plasmids encoding Sonic from either the first (Ml) or 
second (M2) methionine (at position 4) were constructed by first shuttling the 1.7kb Sonic 
fragment of pHH-2 into SLAX-13 using oligonucleotides to modify the 5' end of the cDNA 
such that either the first or second methionine is in frame with the Ncol site of SLAX-13. 

20 The amino acid sequence of Sonic is not mutated in these constructs. The Ml and M2 Sonic 
Clal fragments (v-src 5^\JYK:Sonic) were each then subcloned into RCASBP(A), 
RCANBP(A) and RCASBP(E), generating Sonic/RCAS-Al, Sonic/RCAS-A2, 
Sonic/RCAN-Al, Somc/RCAN-A2, Sonic/RCAS-El and Sonic/RCAS-E2. 

Chick Embryos, Cell Lines And Virus Production 

25 All experimental manipulations were performed on standard specific-pathogen free 

Wliite Leghorn chick embryos (S-SPF) from closed flocks provided fertilized by SPAFAS 
(Norwich, Conn). Eggs were incubated at 37.5°C and staged according to Hamburger, V, et 
al., (1951) 1 Exp. Morpk 88: 49-92. All chick embryo fibroblasts (CEF) were provided by 
C. Cepko. S-SPF embryos and CEFs have previously been shown to be susceptible to 

30 RCASBP(A) infection but resistant to RCASBP(E) farfection (Fekete, D. et al,, (1993b) Proc. 
Natl Acad Set USA 90: 2350-2354). Line 15b CEFs are susceptible to infection by both 
RCASBP(A) and (E). These viral host ranges were confirmed in control experiments. CEF 
cultures were grown and transfected with retroviral vector DNA as described (Morgan, B.A. 
et al., (1993) Nature 358: 236-239; Fekete, D. et al„ (1993b) Proc. Natl Acad Sci USA 90: 

35 2350-2354). All viruses were harvested and concentrated as previously described (Morgan, 
B.A. et al., (1993) Nature 358: 236-239; Fekete, D. et al., (1993b) Proc. Natl Acad. Sci. USA 
90: 2350-2354) and had titers of approximately 10^ cfii/ml. 
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Cell Implants 

A single 60min dish containing line 15b CEFs which had been infected with either 
RCASBP/AP(E), Sonic/RCAS-El or Sonic/RCAS-E2 were grown to 50-90% confluence, 
lightly trypsinized and then spun at 1000 rpm for 5 min in a clinical centrifuge. The pellet 

5 was resuspended in 1 ml media, transferred to a microcentrifuge tube and then 
microcentrifiiged for 2 min at 2000 ipm, Followmg a 30 mm incubation at 37** C, the pellet 
was respun for 2 min at 2000 rpm and then lightly stained in media containing 0.01% nile 
blue sul&le* Pellet fi:agments of approximately 306^ixi x lOOfmi x 50^m were implanted as 
a wedge to the anterior region of hh stage 19-23 wing buds (as described by Riley, B.B. et al., 

10 (1993) Development 118: 95-104). At embryonic day 10, the embryos were harvested, fixed 
in 4% paraformaldehyde in PBS, stained with alcian green, and cleared in methyl salicylate 
(Tickle, C, et al., (1985) Dev. Biol 109: 82-95)- 

Viral Infections 

Concentrated Sonic/RCAS-A2 or Sonic/RCAN-A2 was injected under the AER on 
15 the anterior margin of stage 20-22 wing buds. At 24 or 36 hours post-infection, the embryos 
were harvested, fixed in 4% paraformaldehyde m PBS and processed for whole mount in situ 
analysis as previously described. 

(ii) Co-Localization Of Sonic Expression And Zpa Activity 

ZPA activity has been carefully mapped both spatially and temporally within the limb 
20 bud (Honig, L.S. et al., (1985) J. Embryol exp. Morph 87: 163-174). In these experiments 
small blocks of limb bud tissue firom various locations and stages of chick embryogenesis 
(Hamburger, V et al., (1951) 1 Exp, Morph 88: 49-92) were grafted to the anterior of host 
limb buds and the strength of ZPA activity was quantified according to degrees of digit 
duplication. Activity is first weakly detected along the flank prior to limb bud outgrowth. 
25 Tlie activity first reaches a maximal strength at stage 19 in the proximal posterior margin of 
the limb bud. By stage 23 the activity extends the full length of the posterior border of the 
limb bud. The activity then shifts distally along the posterior margin so that by stage 25 it is 
no longer detectable at the base of the flank. The activity then fades distally until it is last 
detected at stage 29. 

30 This detailed map of endogenous polarizing activity provided the opportunity to 

determine the extent of the correlation between the spatial pattern of ZPA activity and Sonic 
expression over a range of developmental stages. Whole mount in situ hybridization was 
used to assay the spatial and temporal pattern of Sonic expression in the limb bud. Sonic 
expression is not detected until stage 17, at the initiation of lunb bud formation, at which 

35 tune it is weakly observed in a punctate pattern reflecting a patchy expression in a few cells. 
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From that point onAvards the Sonic expression pattern exactly matches the location of the 
ZPA, as determined by Honig, L.S. et al., (1985) J. Embryol exp. Morph. 87: 163-174, both 
in position and in intensity of expression. 

(Hi) Induction Of Sonic Expression By Retinoic Acid 

5 A source of retinoic acid placed at the anterior margin of the limb bud will induce 

ectopic tissue csqable causing mirror-image duplications (Sunmierbell, D. et al., (1983) In 
Limb Development and Regeneration (N.Y.: Ala R. Liss) pp. 109-118; Wanek, N. et al., 
(1991) iVa/ure 350: 81-83). The induction of this activity is not an immediate response to 
retinoic acid but rather takes approximately 18 hours to develop (Wanek, N. et al., (1991) 

10 Nature 350: 81-83). When it does develop, the polarizing activity is not found surrounding 
the implanted retinoic acid source, but rather is found distal to it in the mesenchyme along 
the margin of the limb bud (Wanek, N. et al., (1991) Nature 350: 81-83). 

IfSordc esqxession is truly indicative of ZPA tissue, then it should be mduced in the 
ZPA tissue which is ectopically induced by retinoic acid. To test this, retinoic acid-soaked 

1 5 beads were implanted in &e anterior of limb buds and &e expression of Sonic after various 
lengths of time usii^ whole-mount in situ hybridization was assayed. As the limb bud 
grows, tiie bead remains imbedded proximally in tissue vAnch begins to differentiate. 
Ectopic Sonic e^qnession is first detected in the mesenchyme 24 hours after bead 
inq>lantation. Tliisejqjression is found a short distance from tiie distal edge of the bead. By 

20 36 hours Sonic is strongly raqnessed distal to the bead in a stripe just under the anterior 
ectoderm in a mirror-unage pattern relative to tiie endogenous Sonic expression in the 
posterior of the limb bud. 

■(iv) Effects Of Ectopic Expression Of Sonic On Limb Patterning 

The normal expression pattern of Some, as well as that induced by retinoic acid, is 
25 consistent with Sonic being a signal produced by the ZPA. To determine whether Sonic 
expression is sufficient for ZPA activity, the gene was ectopically expressed witiiin the limb 
bud. In most of the experiments we have utilized a variant of a replication-competent 
retroviral vector called RCAS (Hughes, S.H. et al., (1987) J. Virol. 61: 3004-3012)) both as a 
vehicle to introduce the Sonic sequences into diick cdls and to drive their esqsression. The 
30 feet that there exists subtypes of avian retroviruses which have host ranges restricted to 
particular strains of chickens was taken advantage of to control the r^on infected with the 
Sonic/RCAS virus (Weiss, R. (et al.) (1984) Tumor Viruses, Vol. 1 Weiss et al. eds., 
(N.Y.: Cold Spring Harbor Laboratories) pp. 209-260); Fekete, D. et al., (1993a) Mol. Cell. 
Biol. 13: 2604-2613). Thus a vector with a type E envelope protein (RCAS-E, Fekete, D. et 
35 al., (1993b) Proc. Natl. Acad Set USA 90: 2350-2354) is unable to infect the cells of the 
SPAFAS outbied chick embryos routinely used in our lab. However, RCAS-E is able to 
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infect cells from chick embryos of line 15b. In the majority of experiments, primary chick 
embryo fibroblasts (CEFs) prepared from line 15b embryos in vitro were infected. The 
infected cells were pelleted and implanted into a slit made in the anterior of S-SFF host limb 
buds. Due to the restricted host range of the vector, the infection was thus restricted to the 
5 graft and did not spread through the host limb bud. 

To determine the £ate of cells implanted and to control for any effect of the implant 
pix>cedure, a control RCAS-E vector expressing human placental alkaline phosphatase was 
used. Alkaline phosphatase expression can be easily monitored histochemically and the 
location of infected cells can thus be conveniently followed at any stage. Within 24 hours 
10 following implantation &e cells are dispersed proximally and distally within the anterior 
margin of the limb bud. Subsequently » cells are seen to disperse throughout fte anterior 
portion of the limb and into the flank of tiie embryo. 

Limb buds grafted with alkaline phosphatase expressing cells or uninfected cells give 
rise to limbs with structures indistinguishable from unoperated wild type limbs. Such limbs 

15 have the characteristic anterior-to-posterior digit pattem 2-3-4. ZPA grafts give rise to a 
variety of patterns of digits depending on the placement of the graft within the bud (Tickle, 
C. et al., (1975) Nature ISA: 199-202) and the amount of tissue engrafted (Tickle, C. (1981) 
Nature 289: 295-298). In some instances the result can be as weak as the duplication of a 
single digit 2. However, in optimal cases the ZPA graft evokes the production of a fiill 

20 mirror image duplication of digits 4-3-2-2-3-4 or 4-3-2-3-4 (see Figure 8). A scoring system 
has been devised which rates &e effectiveness of polarizing activity on the basis of the most 
posterior digit duplicated: any graft which leads to &e development of a duplication of digit 4 
has been defined as reflecting 100% polarizing activity (Honig, L.S. et al., (1985) J. Embryol 
Exp. Morph 87: 163-174). 

25 Grafts of 1 5b fibroblasts expressing Sonic resulted in a range of ZPA-like phenotypes. 

In some instances the resultant limbs deviate from the wild type solely by the presence of a 
mirror-imiage duplication of digit 2... The most common digit phehotype resulting from 
grafting 5pnic-infected CEF cells is a mirror-image duplication of digits 4 and 3 with digit 2 
missing: 4-3-3-4. In many such cases the two central digits appear ftised in a 4-3/3-4 pattern. 

30 In a number of the cases the grafts induced full mirror-image duplications of the digits 
equivalent to optimal ZPA grafts 4-3-2-2-3-4. Besides the digit duplications, the ectopic 
expression of Sonic also gave rise to occasional duplications of proximal elements including 
the mdius or ulna, the humerus and the coracoid. While these proximal phenotypes are not 
features of ZPA grafts, they are consistent with an anterior-to-posterior respecification of cell 

35 £ate. In some instances, most commonly when the radius or ulna was duplicated, more 
complex digit patterns were observed. Typically, an additional digit 3 was formed distal to a 
di^licated radius. 
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The mirror-image duplications caused by ZPA grafts are not limited to skeletal 
elements. For example, featho" buds are normally present only along the posterior edge of 
the limb. Limbs exhibiting mirror-image duplications as a result of ectopic Sonic expression 
have feather buds on both their anterior and posterior edges, similar to those observed in ZPA 
S grafts. 

While ZPA grafts have a powerfiil ability to alter limb pattern when placed at the 
anterior margin of a limb bud, they have no effect when placed at the posterior margm 
(Saunders, J.W, ct al., (1968) EpitheliaUMesenchymal Interaction, Fleischmayer and 
Billingham, eds. (Baltimore: Wlliams and Wilkins) pp. 78-97). Presumably, the lack of 
10 posterior effect is a result of polarizing activity ahieady being present in that region of the 
bud. Consistent with this, grafts of Sonic expressing cells placed in the posterior of Ijmb 
buds never result in changes in the number of digits. Some such grafts did produce 
distortions in the shape of limb elements, the most conunon being a slight posterior curvature 
in the distal tips of digits 3 and 4 when compared to wild type wings. 

1 5 (v) Effect Of Ectopic Sonic Expression On Hoxd Gene Activity 

The correct expression of Hoxd genes is part of the procesfs by which specific skeletal 
elements are determined (Morgan, B.A. et al., (1993) Nature 358: 236-239). A transplant of 
a ZPA into the anterior of a chick limb bud ectopically activates sequential transcription of 
Hoxd genes m a pattOTi which nurrors tiie normal sequence of Hoxd gene «pression (Nohno, 
20 T. et al., (1991) Cell 64: 1197-1205; Izpisua-Behnonte, J.C et al., (1991) Nature 350: 585- 
589). Since ectopic Sonic expression leads to the same pattern duplications as a ZPA graft, 
we reasoned that Sonic would also lead to sequential activation of Hoxd genes. 

To test this hypothesis, anterior buds were injected with Sonic/RCAS-A2, a virus 
which is capable of directiy infecting the host strains of chicken embryos. This approach 

25 does not strictiy limit the region e)q»essing Sonic (being only moderately controlled by the 
timing, location and titer of viral injection), and thus might be e7q)ected to give a more 
variable resuU. However, experiments testing the kmetics of vhal spread in mfected limb 
buds indicate that infected cells remain localized near tiie anterior margin of the bud for at 
least 48 hours. Hoxd gene expression was monitored at various times post infection by 

30 whole mount in situ hybridization. As expected, these genes are activated in a mirror-image 
pattern relative then- expression in the posterior of control limbs. For example, after 36 hours 
Hoxd-13 is expressed in a mirror-image symmetrical pattem in the broadened distal region of 
infected limb buds. Similar results were obtained with other Hoxd genes (manuscript in 
preparation). 
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Example 4 

A Ftmctionalty Conserved Homolog ofDrosophila Hedgehog is Expressed in 
Tissues With Polarizing Activity in Zebrofish Embryos 

(i) Eaqperimental Procedures 
Cloning and Sequencing 

Approximately 1.5 x 10^ plaques of a 33h zebrafish embryonic Xgtl 1 cDNA Ubrary 
were screened by plaque hybridization at low stringency (McGinnis, W. et al., (1984) Nature 
308: 428-433) using a mix of two sequences as a probe: a Drosophila hh 40Dbp EcoRI 
fragment and a murine Ihh 264bp BamHI-EcoRI exon 2 fragment Four clones were isolated 
) and subcloned into the EcoRI sites of pUClS T3T7 (Pharmacia). Both strands of clone 8.3 
were sequenced using nested deletions (Pharmacia) and internal oUgonucleotide primers. 
DNA sequences and derived amino acid sequences were analyzed using "Geneworks" 
(IntelUgenetics) and the GCG software packages. 

PCR amplification 

15 Degenerate oligonucleotides «r5.i (SEQroNo:30)and /i«33 (SEQIDNo:31) were 

used to amplify gaiomic zebrafish DNA 

hh 5.1: AG(CA)GITG(CT)AA(AG)GA(AG)(CAXAG)I(GCDIAA 
3.3: CTCIAaGCIA(GA)ICK(GT)IGCIA 

PCR was performed with an initial denaturation at 94'C followed by 35 cycles of 47»C for 1 
20 min, 72*C for 2min and 94»C for 1 min with a final extension at 72«»C. Products were 
subcloned in pUCl 8 (Pharmacia). 

In Situ Hybridization 

In situ hybridizations of zebrafish embryos were performed as described in Oxtoby, 
E. et al., (1993) Nuc. Acids REs. 21: 1087-1095 with the following modifications: Embryos 
25 were rehydrated in ethanol rather than methanol series; the proteinase K digestion was 
reduced to 5 min and subsequent washes were done in PBTw without glycine; the antibody 
was preadsorbed in PBTw, 2mg/ml BSA without sheep serum; and antibody incubation was 
performed in PBTw, 2mg/ml BSA. DrosopMla embryos were processed and hybridized as 
previously described. 

30 Histology 

Stained embryos were dehydrated through ethanol.butanol series, as previously 
described (Godsave, S.F. et al., (1988) Development 102: 555-566), and embedded in 
Fibrowax. 8^m sections were cut on an Anglian rotary microtome 
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UNA Probe Synthesis 

For analysis of Shh e9q>ression» two different templates were used with consistent 
results; (i) phh[c] 8.3 linearized with Bgl 11 to transcribe an antisense RNA probe that 
excludes the conserved region, and (ii) p/i/i[c] 8.3 linearized with Hind III to transcribe an 
5 antisense RNA that covers the complete cDNA. All in situ hybridizations were performed 
with the latter probe which gives better signal. Other probes were as follows: Axid Dral- 
linearized p6TlN (StrShle, U. et al., (1993) Genes & Dev. 7: 1436-1446) using T3 RNA 
polymerase, gsc linearized with EcoRl and transcribed with T7: pax 2 Bam HI*linearized 
pcF16 (Krauss, S. et al., (1991) Development 113: 1193-1206) using T7 RNA polymerase. 
10 In situ hybridizations were performed using labelled RNA at a concentration of 1 ng/ml final 
concentration. Antisense RNA probes were transoibed according to the manu&cturer's 
protocol (DIG RNA Labelling Kit, BCL). 

Zebrcffish Strains 

Wild type fish were bred fi^om a foimder population obtained firom the Goldfish Bowl, 
15 Oxford. The mutant cyclops strain bl6 and the mutant notail strains bl60 and bl9S were 
obtained fi'om Eugene, Oregon. Fish were reared at 28^C on a 14h light/lOh dark cycle. 

RNA Injections 

The open reading fi:ame of Shh was amplified by PGR, using oligonucleotides 
5^-CTGCAGGGATCCACCATGCGGCnTrGACGAG-3' (SEQ ID No:32), which contains 

20 a consensus Kozak sequence for translation initiation, and 5'-CTGCAGGGATC- 
CTTATTCCACACGAGGGATT-3' (SEQ ID No:33), and subcloned into the Bglll site of 
pSP64T (Kreig, P.A. et al., (1984) Nuc Acids Res. 12: 7057-7070). This vector includes 5* 
and y untranslated Xenopus P-Globin sequences for RNA stabilization and is conunonly 
used for RNA injections experiments in Xenopus. In vitro transcribed Shh RNA at a 

25 concentration of approximately 100 \ig/m\ was injected into a single cell of naturally 
spawned zebrafish embryos at one-cell to 4-cell stages using a pressure-pulsed Narishige 
microinjector. The injected volume was within the picolitre range. Embryos were fixed 20 
to 27 hrs after injection in BT-Fix (Westerfield, M. (1989) 77ie Zebrafish Book, (Eugene: The 
University of Oregon Press)) and processed as described above for whole*mount in situ 

30 hybridizations with the axial probe. 

Transgenic Drosophila 

An EcoRl firagment, containing the entire Shh ORF, was purified fix)m the plasmid 
p^Xt[c]8.3 and ligated with phosphatased EcoRl digested transformation vector pCaSpeRhs 
(Thummel, C.S. et al., (1988) Gene 14: 445-456). The recombinant plasmid, pHS Shh 
35 containing the Shh ORF in the correct orientation relative to the heat shock promoter, was 
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selected follovong restriction en^roe analysis of miniprep DNA fxam transformed colonies 
and used to transfonn Drosophila embryos using standard microinjection procedures 
(Roberts, D.B. (1986), Drosophila. A Practical Approach, Roberts, D.B., ed., (Oxford: IRL 
Press) pp. 1-38). 

5 Ectopic Expression In Drosophila Embryos 

Embryos carrying the appropriate transgenes were collected over 2 hr intervals, 
transferred to thin layers of 1% agarose on glass microscope slides and incubated in a plastic 
Petri dish floating in a water bath at 3TC for 30 min intervals. Following heat treatment, 
embryos were returned to 25''C prior to being fixed for in situ hybridizatiori with DIG 
10 labelled single stranded Shh, wg or ptc RNA probes as previously described (Ingham et al., 
(1991) Cwr. Opin. Genet. Dev. 1: 261-267). 

(ii) Molecular Cloning OfZebrafish Hedgehog Homologues 

In an initial attempt to isolate sequences homologous to Drosophila Wj, a zebrafish 
genomic DNA library was screened at reduced stringency with a partial cDNA, WiPCR4.1, 

15 corresponding to the first and second exons of the Drosophila gene (Mohler, J. et al., (1992) 
Development 115: 957-971). This screen proved unsuccessful; howeva, a sunilar screen of a 
mouse genomic liteary yielded a single clone with significant homology to AA., subsequoitly 
designated Bih. A 2641^ BamHl-EcoRI fiagment from Has lambda clone containing 
sequences homologous to the second exon of the Drosophila gene was subcloned and, 

20 together witii the Drosophila partial cDNA fiagment, used to screen a Xgtl 1 zebrafish cDNA 
library that was prepared &om RNA extracted fi»m 33h old embryos. This soreen yielded 
four clones with overhtf^g inserts the longest of which is 1.6kb in length, herein referred to 
asSWi(SEQIDNo:5). 

Ciii) A Family OfZebrafish Genes Homologous To The Drosophila Segment Polarity Gene, 
25 Hiidgehog 

Alignment of the predicted zunino acid sequences of Shh (SEQ ID No: 12) and hh 
(SEQ ID No:34) revealed an identity of 47%, confirming that SWj is a homolog of the 
Drosophila gene. A striking conservation occurs within exon 2: an 80 amino acid long 
domam shows 72% identity between Shh and Dros-HH. (Figure 9A). This domain is also 
30 highly conserved in all M-related genes cloned so far and is therefore likely to be essential to 
the fimction of hh proteins. A second domain of ^proximately 30 amino acids close to the 
carboxy-terminal end, though it shows only 61% ammo-acid identity, possesses 83% 
similarity between Shh and hh when allowing for conservative substitutions and could also, 
therefore, be of functional importance (Figure 9B). Although putative sites of post- 
35 translational. modification can be noted, their position is not conserved between Shh and hh. 



wo 95/18856 



PCTAJS94/14992 



- Lee, JJ- et al,, (1992) Cell 71: 33-50, identified a hydrophobic stretch of 21 amino 
acids flanked dovoistream by a putative site of signal sequence cleavage (predicted by the 
algorithm of von Heijne, G. (1986) Nuc. Acids Res. 11) close to the amino-terminal end of 
hh. Both the hydrophobic stretch and the putative signal sequence cleavage sites of hh, 
5 which suggest it to be a signaling molecule, are conserved in Shh. In contrast to hh, Shh does 
not extend N-terminaliy to the hydrophobic stretch. 

Using degenerate oligonucleotides corresponding to amino-acids flanking the domain 
of high homology between Dros-HH and mouse Ihh exons 2 described above, fragments of 
tiie expected size were amplified from zebrafish genomic DNA by PCR. After subcloning 

10 and sequencing, it appeared that tiiree different sequences were amplified, all of which show 
high homology to one another and to Dros-HH (Figure 10). One of diese corresponds to Shh 
therein refen^ to as 2-hh(a) (SEQ ED No:16) and 2hh(b) (SEQ ID No:17), while the other 
two represent additional zebrafish hh homologs (SEQ ID No:5). cDNAs corresponding to 
one of these additional homologs have recratly been isolated, confirming that it is 

1 5 transcribed. Therefore, Shh represrats a member of a new vertebrate gene family. 

(iv) Shh Expression In The Developing Zebrafish Embryo 
Gastrula stages 

Shh expression is first detected at around the 60% epiboly stage of embryogenesis in 
the dorsal mesoderm. Transcript is present in a triangular shaped area, corresponding to the 

20 embryonic ^eld, the equivalent of the amphibian oiganizer, and is restricted to the inner cell 
layer, the hypoblast. During gastrulation, presumptive mesodermal cells involute to form the 
hypoblast, and converge towards the fiiture axis of the embryo, reaching the animal pole at 
approximately 70% epiboly. At this st^e, Shh -expressing cells extend over the posterior 
third of the axis, and the signal intensity is not entirely homogeneous, appearing stronger at 

25 the base than at the apex of the elongating triangle of cells. 

This early spatial distribution of Shh transcript is reminiscent of that previously 
described for axial^ a forkhead-Klated gene; however, at 80% epiboly, axial expression 
extends further towards the animal pole of the embryo and we do not see Shh expression in 
the head area at these early developmental stages. 

30 By 100% epiboly, at 9.5 hours of development, the posterior tip of the Shh expression 

domain now constitutes a continuous band of cells that extends into the head. To determine 
the precise anterior boundary of Shh expression, embryos were simultaneously hybridized 
with probes of Shh and pax-2 (previously pax[b]), the early expression domain of vMch 
marks the posterior midbrain (Krauss, S. etal. (1991) Development 113: 1193-1206). By this 

35 stage, the anterior boundary of the Shh expression domain is positioned in the centre of the 



wo 95/18856 



PCTAJS94/14992 



animal pole and coincides approximately with that of axiai At the same stage, prechordal 
plate cells expressing the homeobox gene goosecoid (gsc) overlap and underlay the 
presumptive forebram (Statchel, S.E. et al., (1993) Development 117: 1261-1274). Whereas 
axial is also thought to be expressed in head mesoderaial tissue at this stage, we cannot be 
certam whether Shh is e3q>ressed in the same cells. Sections of stained embryos suggest that 
in the head Shh may by this stage be e3q)ressed exclusively in neuroectoderaial tissue. 

(v) Somitogenesis 

By the onset of somitogenesis (approximately 10.5h of development), Shh expression 
in the head is clearly restricted to the ventral floor of the brain, extendmg fix)m the tip of the 
diencephalon caudally through the hindbrain. At this stage, expression of axial has also 
disappeared from the head mesoderm and is similarly restricted to the floor of the brain; in 
contrast to Shh, however, it extends only as far as the anterior boundary of the midbrain. At 
this point, gsc expression has become very weak and is restricted to a ring of cells that appear 
to be migrating away from the dorsal midline. 

As somitogenesis continues, Shh expression extends in a rostral-caudal progression 
throughout the ventral region of the central nervous system (CNS). Along the spinal cord, 
the exprrasion domain is restricted to a single row of cells, the floor plate, but gradually 
broadens in the hindbrain and midbrain to become 5-7 cells in diameter, with a triangular 
shewed lateral extension in the ventral diencephalon and two strongly staining bulges at the 
tip of the forebrain, presumably in a region fated to become hypothalamus. 

As induction of Shh in the floor plate occurs, repression in the underlying mesoderm 
begins to fade away, in a sunilar manner to axial (StrShle, U. et al., (1993) Genes & Dev. 7: 
1436-1446). This downregulation also proceeds in a rostral to caudal sequence, coinciding 
with the changes in cell shape that accompany notochord differentiation. By the 22 somite 
stage, mesodermal Shh expression is restricted to the caudal region of the notochord and in 
the expanding tail bud where a bulge of undiflFerentiated cells contmue to express Shh at 
relatively high levels. Expression in tiie midbrain broadens to a rhombic shaped area; 
cellular rearrangements that lead to the 90*" kink of forebrain structures, position 
hypothalamic tissue underneath the ventral midbrain. These posterior hypothalamic tissues 
do not express Shh. In addition to Shh expression in the ventral midbrain, a narrow stripe of 
expressing cells extends dorsally on either side of the third ventricle from the rostral end of 
the Shh domain in the ventral midbrain to the anterior end of, but not including, the 
epiphysis. The most rostral Shh expressing cells are confined to the hypothalamus. In the 
telencephalon, additional Shh expression is initiated m two 1-2 cell wide stripes. 

By 36 hours of development, Shh expression in the ventral CNS has undergone 
further changes. While expression persists in the floor plate of the tailbud, more rostrally 
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located floor plate cells in the spinal cord cease to express the gene. In contrast, in the 
hindbrain and forebrain Shh expression persists and is further modified. 

At 26-28h, Shh expression appears in the pectoral fin primordia, that are visible as 
placode like broadenings of cells underneath the epithelial cell layer that covers the yolk. By 
5 33 hrs of development high levels of transcript are present in the posterior margin of the 
pectoral buds; at the same time, expression is initiated in a narrow stripe at the posterior of 
the first gill. E9q>ression continues in tiie pectoral fin buds in lateral cells m &e early larva. 
At this stage, Shh transcripts are also detectable in cells adjacent to the lumen of the foregut 

(vx) Expression Of Shh In Cyclops And Notail Mutants 

10 Two mutations affecting the dififerentiation of the Axial tissues that express Shh have 

been described in zebrafish embryos homozygous for the cyclops (eye) mutation lack a 
differentiated flooiplate (Hatta, K. et al., (1991) Nature 350: 339-341). By contrast, 
homozygous notail (ntl) embryos are characterized by a fidlure in notochord maturation and a 
disruption of normal development of tail structures (Halpem, M.E. et al., (1993) Cell 75: 99- 

15 111). 

A change in Shh e}qpression is apparent in eye embryos as early as tiie end of 
gastrulation; at this stage, the anterior limit of expression coincides precisely vnih the two 
pax-l stripes in the posterior midbrain. Thus, in contrast to wild-type embiyos, no Shh 
e>q)ression is detected in midline structures of the midbrain and forebrain. By the 5 somite 
20 stage, Shh transcripts are present in the notochord which at this stage extends until 
rhombomere 4; however, no e?q)ression is detected in more anterior structures. Furthermore, 
no Shh e}q>ression is detected in the ventral neural keel, in particular in the ventral portions of 
the midbrain and forebrain. 

At 24 hours of development, the morphologically visible eye phenotype consists of a 
25 fusion of the eyes at the midline due to the complete absence of the ventral diencephalon. As 
at earlier developmental stages, Shh expression is absent from neural tissue. Shh expression 
in the extending tail bud of vsdld-type embzyos is seen as a single row of floor plate cells 
throughout the spinal cord. In a <yc mutant, no such Shh induction occurs in cells of the 
ventral spinal cord with the exception of some scattered cells that show transient expression 
, 30 near the tail. Shnilarly, no Shh egression is seen rostrally in tiie ventral neural ^be. 
However, a small groiq> of Shh expressing cells is detected underneath the epiphysis which 
presumably correspond to the dorsal-most group of Shh e3q>ressing cells in the diencephalon 
of wild-type embryos. 

In homozygous notail (ntl) embryos, no Shh staining is seen in mesodermal tissue at 
35 24 hours of development, consistent with the lack of a notochord in these embryos; by 
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contrast, expression throughout the ventral CNS is unaffected. At the tail bud stage, 
however, just prior to the onset of somitogenesis, Shh expression is clearly detectable in 
notochord precursor cells. 

(vii) Irgection Of Synthetic Shh Transcripts Into Zebrqfish Embryos Induces Expression Of A 
5 Floor Plate Marker 

To investigate the activity of Shh in the developing embryo, an over-expression 
strategy, similar to that employed in the analysis of gene function in Xenopus, was adopted. 
Newly fertilized zebrafish eggs were inj ected with synthetic Shh RNA and were fixed 1 4 or 
24 hours later. As an assay for possible changes in cell fete consequent upon the ectopic 
10 activity of Shh, we decided to analyze Axial expression, since this gene serves as a marker for 
cells in which Shh is normally expressed. A dramatic, though highly localized ectopic 
expression of Axial in a significant proportion (21/80) of the injected embryos fixed after 24 
hours of development is observed. Affected embryos show a broadening of the Axial 
expression domain in the diencephalon and ectopic Axial expression in the midbrain; 
15 however, in no case has ectopic expression in the telencephalon or spinal cord been observed. 
Many of the injected embryos also showed disturbed forebrain structures, in particular 
smaUer ventricles and poorly developed eyes. Amongst embryos fixed after 14h. a snnilar 
proportion (8/42) exhibit the same broadening and dorsal extension of the Axial stripe in the 
diencephalon as well as a dorsal extension of Axial staining in the midbrain; again, no 
20 changes in Axial expression were observed caudal to the hindbrain with the exception of an 
increased number of expressing cells at the tip of the tail. 

(vtti) Overexpression Of Shh In Drosophila Embryos Activates The hh-Dependent Pathway 

In order to discover whether the high degree of structural homology between the 
Drosophila and zebrafish hh genes also extends to the functional level, an overexpression 
25 system was used to test the activity of Shh in flies. Expression of Dros-HH driven by the 
HSP70 promoter results in the ectopic activation of both the normal targets of W2 activity; tiie 
wg transcriptional domain expands to fill between one third to one half of each parasegment 
whereas ptc is ectopically activated in all cells except those expressing en (Ingham, P.W. 
(1993) Nature 366:560-562). To compare die activities of die fly and fish genes, flies 
30 transgenic for AUSShh construct were generated described above and subjected to the same 
heat shock regime as H 5W. transgenic flies. HS Shh embryos fixed immediately after the 
second of two 30 min heat shocks exhibit ubiquitous transcription of the Shh cDNA. 
Similarly treated embryos were fixed 30 or 90 mm after the second heat shock and assayed 
for wg or ptc transcription. Both genes were found to be ectopically. activated m a similar 
35 manner to tiiat seen in heat shocked H Shh embryos; thus, the zebrafish Shh gene can activate 
the same pathway as the endogenous hh gene. 
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Cloning, Expression and Localization of Human Hedgehogs 

(i) Experimental Procedures 
5 Isolation of human hedgehog cDNA clones. 

Degenerate nucleotides used to clone chick Shh (Riddle et al, (1993) Cell 75:1401- 
1416) were used to amplify by nested PGR human genomic DNA. The nucleotide sequence 
of these oligos is as follows: 

10 vHH5O:5'^AATTCCCAG(CA)GITG(CI0AA(AG)GA(AG)(CA){AG)I^^^ 
(SEQIDN0:18); 

vHH30:5'-TCATCGATGGACCCA(GA)TC(GA)AAICCIGC(TC)TC-3' (SEQ ID N0:19); 
vHH3I:5'-^TCTAGAGCTCIACIGClA(GA)IC(GT)IGGIA-3' (SEQ ID NO:20) 

IS The e)q>ected 220 bp PGR product was subcloned mto pGEM7zf (Promega) and 

sequenced using Sequenase v2.0 (U.S. Biochemicals). One clone showed high nucleotide 
similarity to mouse Ihh and mouse Shh sequence (Echelard et al., (1993) Cell 75:1417-1430) 
and it was used for screenmg a human fetal lung 5 -stretch plus cDNA library (Glontech) in X 
gtlO phage. The library was screened following the protocol suggested by the company and 

20 two positive plaques were identified, ptuified, subcloned into pBluescript SK+ (Stratagene) 
and sequenced, identifying them as the human homologues of Shh (SEQ ID N0:6) and Ihh 
(SEQIDNO:7), 

One clone contained the full coding sequence of a human homolog of Shh as well as 
150 bp of 5' and 36 bp of 3' untranslated sequence. The other clone, which is the human 

25 homolog of Ihh, extends fix>m 330 bp 3* of the coding sequence to a point close to the 
predicted boundary between the first and second exon. The identity of these clones was 
determined by comparison to the murine and chick genes. The protein encoded by human 
Shh has 92.4% overall identity to the mouse Shh, including 99% identity m the amino- 
terminal half The carboxyl-terminal half is also highly conserved, although it contains short 

30 stretches of 1 6 and 1 1 amino acids not present in the mouse Shh, The human Ihh protein is 
96.8% identical to tiie mouse Ihh. The two predicted human protems are also highly related, 
particularly in their amino*terminal halves where they are 91.4% identical. They diverge 
significantly in their carboxyl halves, v^ere they show only 45.1% identity. The high level 
of similarity in the amino portion of all of &ese proteins implies that this region encodes 

35 domams essential to the activity of this class of signaling molecules. 
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Northern blotting 

Multiple Tissue Northern Blot (Clontech) prepared from poly A+RNA isolated from 
human adult tissues was hybridized with either full length ^^P-labeled hiiman Shh clone or 
S ^^P-labeled human Ihh clone following the protocol suggested by the company. 

Digoxigenin in situ hybridization. 

Sections: tissues from normal human second trimester gestation abortus specimens 
were washed in PBS and fixed overnight at 4®C paraformaldehyde in PBS, equilibrated 24 

10 hours at 4°C in 50% sucrose in PBS and then placed in 50% sucrose in oct for one hour 
before embedding in oct Cryostat sections (10-25 mm) were collected on superfrost plus 
slides (Fisher) and frozen at -SO^^C until used. Following a postfixation in 4% 
paraformaldehyde the slides were processed as in Riddle et al., (1993) Cell 75:1401-1416 
with the following alterations: proteinase K digestion was performed at room temperature 

15 from 1-15 minutes (depending on section thickness), prehybridization, hybridization and 
washes time was decreased to 1/10 of time. 

Whole-mounts: tissues from normal second trimester hunum abortus specimens were 
washed in PBS, fixed overnight at 4^C in 4% paraformaldehyde in PBS and Hien processed 
as in Riddle et al., (1993) Cell 75:1401-1416. 

20 

Isolation of an Shh PI clone. 

The human Shh gene was isolated on a PI clone from a PI libraiy (Pierce and 
Sternberg, 1992) by PGR (polymerase chain reaction) screening. Two oligonucleotide 
primers were derived from tiie human Shh sequence. The two olignucleotide primers used for 
25 PGR were: 

SHHF5'-AGGGAGGGCTGGGAGGAAGATGGG-3' (SEQ ID NO:43) 
SHR5'-CGCTGGGTGGTAGGGGATGAAGGAG-3' (SEQ ID NO:44) 
The PGR reaction was carried using standard conditions as described previously (Thierfelder 
et al., 1994) except that the annealing tempemture was 65^G. These primers amplified a 1 19 

30 bp fi:agment fix>m human and PI clone DNA. The PI clone was designated SHHPl. After 
tiie PI clone was isolated these oligonucleotides were used as sequencing primers. A 
2.5Kb£coRI fragment that encoded a CA repeat was subcloned from this PI clone using 
methods described previously (Thierfeldo: et al. 1994). Oligonucleotide primers that 
amplified this CA repeat sequence were fashioned from the flanking sequences: 

35 SHHCAF5'-ATGGGGATGTGTGTGGTCAAGTGTA-3' (SEQ ID NO:45) 
SHHCAR5'.TTCACAGACTCTCAAAGTGTATTTT-3' (SEQ ID NO:46) 
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Mapping the human Ihh and Shh genes. 

The human Ihh gene was mapped to chromosome 2 using somatic cell hybrids from 
NIGMS mapping pannel 2 (GM10826B). 
5 The gene was mapped to chromosome 7 using somatic cell hybrids from NIGMS 

mapping panel 2 (GM10791 and GM10868). 

Linkage betweoi the limb deformity locus on chromosome 7 and the Shh gene was 
demonstrated using standard procedures. Family LD has been desoibed previously 
(Tkukuiov et al, (1994) Nature Genet. 6:282-286). A CA repeat bearing sequence near the 
10 SMi gene was amplified from the DNA of all members of Family LD by PGR using the 
SHHCAF and SHHCAR primos. Linkage between the CA repeat and the LD disease gene 
se^«gating in Family LD was estimated by the MLINK program (Oct, 1967). Penetrance 
was set at 100% and the allele frequencies were determined using unrelated spouses in the 
LD fronily. 

15 

Interspecific Backcross Mapping. 

Interspecific backcross progeny were generated by mating (C57BL/6J x M. spretus) 
Fl females and C57BL/6J males as described (Copeland and Jenkins, (1991) Trends Genet. 
7; 113-11 8). A total of 205 N2 mice were used to map the Ihh and Dhh loci. DNA isolation, 

20 restriction en^one digestions, agarose gel electrophoresis. Southern blot transfer and 
hybridization were performed essentially as described (Jenkins et al., (1982) J. Virol 43:26- 
36). All blots were prepared with Hybond-N+ nylon membrane (Amersham). Hie probe, an 
~ 1 .8kb EcoKl fiagment of mouse cDNA, detected a major fiagment of 8.5 kb in C57BL/6j 
(B) DNA and a major fragmrat 6.0 kb in M. spretus (S) DNA following digestion with BglH. 

25 The Shh probe, an ~ 900 bp Smal fragment of mouse cDNA, detected HincU fragments of 7.5 
and 2.1 kb (B) as well as 4.6 and 2.1 (S). The Dhh probe, and ~ 800 bp BamHi/Ecc'Ri 
fragment of mouse genomic DNA, detected migor fragments of 4.7 and 1.3 kb (B) and 8.2 
and 1.3 kb (S) following digestion witii Sphl. The presence or absence of M. spretus specific 
fi-agments was followed in backcross mice. 

30 A description of the probes and RFLPs for lod used to position Ihe Ihh, Shh and Dhh 

loci in the interspecific backcross has been reported. These include: Fnl, Vil and Acrg, 
chromosome 1 (Wilkie et al., (1993) Genomics 18:175-184), Gnail, En2, 116, chromosomes 
5 (Miao et al., (1994) PNAS USA 91:11050-11054) and Pdgfb, Gdcl and Rarg, chromosome 
15 (Brannan et al, (1992) Genomics 13:1075-1081). Recombination distances were 

35 calculated as described (Green, (1981) Linkage, recombination and mapping. In "Genetics 
and Probability in Animal Breeding Experiments'', pp. 77-1 13, Oxford University Press, NY) 
using the conqputer program SPRETUS MADNESS. Gene order was determined by 
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minimizing the number of recombination events required to explain the aUele distribution 
pattons. 

(ii) Expression of Human Shk and Ml 

5 

To investigate the tissue distribution of Shh and Ihh expression, poly(A)+RNA 
samples from various adult human tissues were probed with the two cDNA dones. Of the 

tissues tested, an M-specific message of -2.7 Kb is only detected in liver and kidney. Shh 
transcripts was not detected in the RNA from any oftheaduh tissues tested. Allthesamples 
10 contained approximately equal amounts of intact RNA, as determined by hybridization with a 
control probe. 

The hedgehog femUy of genes were identified as mediators of embryonic patterning 
in flies and vertebiates. No adult expression of these genes had previously been reported. 
These results indicate that Ihh additionally plays a role in adult Uver and kidney. Since the 
15 hedgehog genes encode intercellular signals, Ihh may function in coordmating the properties 
of different cell types in these organs; Shh may abo be used as a signaling-molecule in ±e 
adult, either in tissues not looked at here, or at levels too low to be detected under these 
conditions. 

In situ hybridization was used to investigate the expression of Shh in various mid- 
20 gestational human fetal organs. Shh expression is present predominantiy in endodemn 
derived tissues: the respiratory epitiielium, collecting ducts of tiie kidney, transitional 
eptiielium of tije ureter, hepatocytes, and small intestine epitiielium. Shh was not detectable 
in fetal heart or placental tissues. The intensity of expression is increased in primitive 
differentiating tissues (renal blastema, base villi, branching lung buds) and decreased or 
25 absent in differentiated tissues (e.g. glomeruli). Shh expression is present in tiie mesenchyme 
immediately abutting tiie budding respiratory tubes. The non-uniform pattern of Shh 
expression in hepatocytes is consistent witii expression of otiier genes in adult liver 
(Dingemanse et al. (1994) Differentiation 56:153-162). The base of villi, tiie renal blastema, 
and tiie lung buds are all regions expressing Shh and tiiey are areas of active growtii and 
30 differentiation, suggesting Shh is important in tiiese processes. 

(Hi) The Chromosomal Map Location of Human Shh and Ihh 

Since Shh is known to mediate patterning during tiie development of tiie mouse and 
chick and tiie expression of 5Wi and Ihh are suggestive of a similar role m humans, mutations 
35 in tiiese genes would be expected to lead to embryonic letiiaUty or congenital defects. One 
way of investigating tiiis possibiUty is to see whettier tiwy are genetically linked to any 
known inherited disorders. 
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Shh' and /Tt^-specific primers were designed from their respective sequences and were 
used in PGR reactions on a panel of rodent-human somatic cell hybrids. Control rodent DN A 
did not amplify specific bands using these primers. In contrast, DNA fit>m several rodent* 
human hybrids resulted in PGR products of the appropriate size allowing us to assign Shh to 
5 chromosome 7q and IM2 to chromosome 2. 

One of the central roles of chick Shh is in regulating the anterior-posterior axis of the 
limb. A human congenital polysyndactyly has recently been mapped to chromosome 7q36 
(Tsukurov et al, (1994) Nature Genet 6:282-286; Heutmk et al, (1994) Nature Genet 
6:287-291). The phenotype of this disease is consistent with defects that might be expected 

10 from abexrant expression of Shh in the limb. Therefore, the chromosomal location of Shh 
was mapped more precisely, in particular in relation to the polysyndactyly locus. 

API phage library was screened using the Shh specific primers for PGR amplification 
and clone SHHPl was isolated. Glone SHHPl contained Shh sequence. A Southern blot of 
an £coRI digest of this phage using [CA]/[GT] probe demonstrated that a 2.5 Kb EcoBl 

15 fragment contained a GA repeat. Nucleotide sequence analysis of this subcloned EccIRl 
fi'agment demonstrated that the GA repeat lay near the £coRI sites. Primers flanking the GA 
repeat were designed and used to map the location of Shh relative to other markers on 7q in 
individuals of a large kindred with complex polysyndactyly (Tsukurov et ai, (1994) Nature 
Genet 6:282-286). Shh maps close to D75550 on 7q36, with no recombination events seen 

20 in this study. It is also extremely close to, but distinct fi:om, the polysyndactyly locus with 
one recombination event observed between them (maximum lod score = 4.82, © = 0.05). 
One unaffected individual (pedigree ID V-10 in Tsukurov et al, (1994) Nature Genet 6:282- 
286) has the Shh linked GA repeat allele found in all aiFected frunily members. No 
recombination was observed between the locus En2 and the Shh gene (maximum lod score - 

25 1.82,0 = 0.0). 

(iv) Chromosomal mapping of the Murine Ihh Shh and Dhh genes. 

The murine chromosomal location of Ihh, Shh and Dhh was determined using an 
interspecific backcross mapping panel derived fix)m crosses of [(G57BL/6J x spetrus)¥\ X 
30 G57BL/J)] mice. cDNA fragments from each locus were used as probes in Southern blot 
hybridization analysis of G57BL/6J and M spretus genomic DNA that was separately 
digested with several different restriction enzymes to identify informative restriction 
fragment length polymorphisms (RFLPs) usefixl for gene mapping. The strain distribution 
pattern of each RFLP in the interspecific backcross was then determined by following the • 
* 35 presence or absence of RFLPs specific for M spretus in backcross mice. 

Ihh mapped to the central region of mouse chromosome 1, 2.7 cM distal of Fnl and 
did not recombine with Vil in 190 animals typed in common, suggesting that the two loci are 
within 1 .6 cM (upper 95% confidence level) (Fig. 16). Shh mapped to the proximal region of 
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mouse chromosome 5, 0.6 cM distal of En2 and 1.9 cM proximal of 116 in accordance to 
Chang ^t al., (1994) Development 120:3339-3353. Dhh mapped to the very distal region of 
mouse chromosome 15, 0.6 cM distal of GdcJ and did not recombine with Rarg in 160 
animals typed in common, suggesting that the two loci are within 1 .9 cM of each other (upper 

5 95% confidence level) (Fig. 16). 

Interspecific maps of chromosome 1, 5 and 15 wctc compared with composite mouse 
linkage maps tiiat report fee map location of many uncloned mouse mutations (compiled by 
M.T. Davisson, T.H. Roderick, A.L. Hillyard and D.P. Doolittle and provided Stom GBASE, 
a computerized database maintained at The Jackson Laboratory, Bar Harbor, ME). The 

10 hemimelic extra-toe (Hx) mouse mutant maps 1 .1 cM distal to En2 on chromosome 5 (Martin 
et al, (1990) Genomics 6:302-308), a location in close proximity to v^iiere Shh has been 
positioned . /fir is a dominant mutation v(Mch results in preaxial Polydactyly and hemimelia 
affecting all four limbs (Dickie, (1968) Mouse News Lett 38:24; Knudsen and Kochhar, 
(1981) J. Emhryol Exp. Morph. 65: Suppl. 289-307). Shh has previously been shown to be 

15 expressed in the limb (Echelard etal, (1993) Cell 75:1417-1430). To determine whether Shh 
and Hx are tightly linked we followed their distribution in a backcross panel in vvWch Hx was 
segregating. Two recombinants between Shh and He were identified, thus excluding the 
possibility that the two loci are allelic and these observations are again consistent with those 
of Chang et al, (1994) Development 120:3339-3353. While Ihere are several other mutations 

20 in die vicinity of Ihh and Dhh^ none is an obvious candidate for an alteration in the 
corresponding gene. 

The central region of mouse chromosome 1 shares homology witii human 
chromosome 2q (summarized in Fig. 16). Placement of JWi in this interval suggests the 
human homolog of Jhh will reside on 2q, as well. Similarly, it is likely that human homolog 
25 of Dhh will reade on human chromoscmie 12q. 

Fxample 6 

Proteolytic Processing Yields Two Secreted Forms of Sonic Hedgehog 
(i) Es^erimental Procedures 
30 In vitro Translation and Processing 

Mouse and chick sonic hedgehog codxag sequences were inserted into die vector 
pSP64T (kindly provided by D. Melton) which contains an SP6 phage promoter and both 5' 
and 3' untranslated sequences derived ftom the Xenopus laevis p-Globin gene. After 
restriction endonuclease digestion with Sal I to generate linear templates, RNA was 
35 transcribed in vitro using SP6 RNA polymerase (Promega, Inc.) in the presence of 1 mM cap 
structure analog (m7G(5')ppp(5')Gm; Boehringer-Mannheim, Inc.) Following digestion with 



wo 95/18856 PCTAJS94/14992 

RQl DNase I (Promega, Inc.) to remove the DNA template, transcripts were purified by 
phenolrcholoroform extraction and ethanol precipitation. 

Rabbit reticulocyte lysate (Promega, Inc.) was used according to the manufacturer's 
instructions. For each reaction, 12.5 ^il of lysate was programmed with 0.5-2.0 ng of in vitro 
5 transcribed RNA. The reactions contained 20 ^Ci of Express labeling mix (NEN/DuPont, 
Inc.) were included. To address processing and secretion in vitro, 1.0-2.0 nl of canine 
pancreatic microsomal membranes (Promega, Inc.) were included in the reactions. The fmal 
reaction volume of 25 \i\ was incubated for one hour at 30®C. Aliquots of each reaction 
(between 0.25 and 3.0 ^l) were boiled for 3 minutes in Laemmli sample buffer (LSB: 125 
10 mM Tris-Hcl [pH 6.8]; 2% SDS; 1% 2-mercaptoethanol; 0.25 mg/ml bromophenol blue) 
before separating on a 15% polyacrylamide gel. Fixed gels were processed for fluorography 
using EnHance (NEN/DuPont, Inc.) as described by the manufacturer. 

Glycosylation was addressed by incubation with Endoglycosidase H (Endo H; New 
England Biolabs, Inc.) according to the manufacturer's directions. Reactions were carried out 
15 for 1-2 hr at 37^0 before analyzing reaction products by polyacrylamide gel electrophoresis 
(PAGE). 

Xenopus Oocyte Injection and Labeling 

Oocytes were en2ymatically defolliculated and rinsed with OR2 (50 mM HEPES [pH 
7.2], 82 mM NaCI, 2.5 mM KCl, 1.5 mM Na2HP04). Healthy stage six oocytes were 

20 injected with 30 ng of in vitro transcribed, capped mouse Shh RNA (prepared as described 
above). Following a 2 hr recovery period, healthy injected oocytes and uninjected controls 
were cultured at room temperature in groups of ten in 96-well dishes containing 0.2 ml of 
0R2 (supplemented with 0.1 mg/ml Gentamicui and 0.4 mg/ml BSA) per well. The 
incubation medium was supplemented with SO ^Ci of Eiqptess labeling mix. Three days after 

25 mjection, the culture media were collected and e:q)ression of Shh protein analyzed by 
immunoprecipitation. Oocytes were rinsed several times in 0R2 before lysing in TENT (20 
mM Tris-HCl [pH 8.0]; 150 mM NaCl, 2mM EDTA; 1% Triton-X-lOO; 10 ^1/oocyte) 
supplemented with 1 |iig/ml aprotinin, 2 ^g/ml leupeptin and ImM 
phenyhnethylsidronylfiuoride (PMSF). After centrifugation at 13000 x g for 10 minutes at 

30 4^0, soluble protein supematants were recovered and analyzed by immunoprecipitation (see 
below). 

Cos Cell Transfection and Labeling 

Cos cells were cultured in Dulbecco*s Modified Eagle Medium (DMEM; Sigma, Inc.) 
supplemented with 10% fetal bovine serum (Gibco/BRL), 2 mM L-Glutamine (Gibco/BRL) 
35 and 50 mU/ml penicillin and 50 }xg/ml streptomycin (Gibco/BRL). Subconfluent cos cells in 
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35 mm or 60 mm dishes (Falcon, Inc.) were transiently transfected with 2 mg or 6 mg 
supercoiled plasmid DNA, respectively. Between 42 and 44 hr post-transfection, cells were 
labeled for 4-6 hr in 0.5 ml (35 nmi dishes) or 1.5 ml (60 mm dishes) serum-free DMEM 
lacking Cysteine and Methionine (Gibco/BRL) and supplemented with 125 ^Ci/ml each of 
5 Express labeling mix and L-35S-Cysteine (NEN/DuPont). After labeling, media were 
collected and used for immunoprecipitation. Cells were rinsed with cold PBS and lysed m 
the tissue culture dishes by the addition of 0.5 ml (35 mm dishes) or 1.5 ml (60 mm dishes) 
TENT (with protease inhibitors as described above) and gentle rocking for 30 minutes at 4^C. 
Lysates were cleared by centrifugation (13000 x g for 5 min. at 4^C) and the si^matants 
1 0 were analyzed by immunoprecipitation (see below). 

Baculovirus Production and Infection 

A recombinant baculovirus expressing mouse sonic hedgehog with a myc epitope tag 
insCTted at the carboxy terminus was generated using the Baculogold kit (Pharmingen, Inc.). 
The initial virus production used Sf 9 cells, followed by two rounds of amplification in High 

15 Five cells (Invitrogen, Inc.) in serum-free medium (ExCell 401; Invitrogen, Inc.). A 
baculovirus lacking Shh coding sequences was also constructed as a control. For protein 
induction. High Five cells were infected at a multiplicity of ^)proximately 15. Three days 
later, medium and cells were collected by gentle pipetting. Cells were collected by 
centrifugation (1000 x g) and the medium was recovered for Western blot analysis. Cell 

20 pellets were washed twice in cold PBS and lysed in TENT plus protease inhibitors (see 
above) by rotating for 30 minutes at A^C in a microcentrifuge tube. The lysate was cleared as 
described above prior to Western blotting. 

Western Blotting 

For Western blotting, 0.25 ml samples of media (1% of the total) were precipitated 
25 with TCA and redissolved in 15 ^1 of LSB. Cell lysate samples (1% of total) were brought to 
a final volume of 15 (xl with water and concentrated (5X) LSB . Samples were boiled S 
minutes prior to separation on a 15% acrylamide gel. Proteins were transferred to PVDF 
membrane (Immobilon-P; Millipore, Inc.) and blocked in BLOTTO (5% wA^ non-fiit dried 
milk in PBS) containing 0.2% Tween-20. Hybridoma si5)ematant recognizing the human c- 
30 myc epitope (9E10; Evan, G.L et al., (1985) MoL Cell Biol 5:3610-3616) was added at a 
dilution of 1:200 for one hour followed by a 1:5000 dilution of Goat anti«Mouse-Alkaline 
phosphatase conjugate (Promega, Inc.) for 30 minutes. Bands were visualized using the 
Lumi-Phos 530 reagent (Boehringer-Maimheim) according to the manufacturer's directions. 



35 
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Immunoprecipitation 

Cell lysates (Xenopus oocytes or cos cells) were brought to 0.5 ml with TENT (plus 
protease inhibitors as above). Media samples (0R2 or DMEM) were cleared by 
centrifugation at 13000 x g for 5 min. (4oC) and lOX TENT was added to a final 
5 concentration of IX (final volume: 0.5-1. S ml). The c-myc monoclonal antibody hybridoma 
siq)eniatant was added to 1/20 of the final volume. Samples were rotated for 1 hr at 4^C., 
then 0.1 ml of 10% (v/v) protein A-Sepharose CL-4B (Pharmacia, Inc.) was added. Samples 
were rotated an additional 14-16 h. Immune complexes were washed 4 times with 1.0 ml 
TENT. Immunoprecipitated material was eluted and denatured by boiling for 10 minutes in 
10 25 ^1 IX LSB. Following centrifiigation, samples were separated on 15% acrylamide gels 
and processed for fluoirography as described previously. Samples for Endo H digestion were 
eluted and denatured by boiling for 10 minutes in the provided denaturation buffer followed 
by digestion with Endo H for 1-2 hr at 37oC. Concentrated (SX) LSB was added and the 
samples were processed for electrophoresis as described. 

15 For immunoprecipitation with the anti-mouse SKh serum» samples (Cos cell lysates 

and DMEM) were precleared by incubating 1 hr on ice wi& 3 ^1 pre-unmune serum, 
followed by the addition of 0.1 ml 10% (v/v) Protein A-Sepharose. After rotating for 1 hr at 
4 C, supematants were recovered and incubated for 1 hr on ice wi& 3^1 depleted anti-mouse 
Shh serum (see below). Incubation with Protein A-Sepharose, washing, elution and 

20 electrophoresis wrare tiien performed as described above. 

Immunofluorescent Staining of Cos Cells 

Twenty-four hours after transfection, cells were transfmed to S-chamber slides (Lab- 
Tek, Inc.) and allowed to attach an additional twenty-fow hovxs. Cells were fixed in 2% 
paraformaldehyde/0.1% glutaraldehyde, washed in PBS and permeabilized in 1% Triton-X- 

25 100 (Munro, S. and Pelham, H.R.B., (1987) Cell 48:899-907). After washing m PBS, cells 
were treated for 10 minutes in 1 mg/ml sodium borohydride. Cells were incubated with the 
c-myc monoclonal antibody hybridoma supematant (diluted 1:10) and the affinity purified 
mouse Sonic hedgehog antiserum (diluted 1:4) for 45 minutes followed by incubation in 
1:100 Goat-anti Mouse IgG-RITC plus 1:100 Goat anti Rabbit IgG FITC (Southern 

30 Biotechnology Associates, Inc.) for 45 minutes. DAPI (Sigma, Inc.) was included at 0.3 jx 
g/ml The slides were mounted in Slo-Fade (Molecular Probes, Inc.) and photographed on a 
Leitz DMR compound microscope. 

Antibody Production and Purification 

A PCR fiBgment encoding amino acids 44-143 of mouse Sonic hedgehog was cloned 
35 in fi-ame into the Eco Rl site of pGEX-2T (Pharmacia, Inc.). Transformed bacteria were 
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induced with ff TG and the fusion protein purified on a Glutathione- Agarose affinity column 
(Pharmacia, Inc.) according to the manufacturer's instructions. Inoculation of New Zealand 
White rabbits, as well as test and production bleeding were carried out at Hazelton Research 
Products, Inc. 

To deplete the serum of antibodies against Glutathibne-S-transfenise (GST) and 
bacterial proteins, a lysate of E. coli transformed with pGEX-2T and induced witii IPTG was 
coiqpled to AfB-(jel 10 (Bio-Rad, Inc.) The serum was incubated in batch for two hours with 
the depletion matrix before centrifugadon (1000 x g for 5 min.) and collection of the 
supernatant To make an affinity matrix, purified bacterially expressed protein corresponding 
to the amino temiinal two-thirds of mouse Sonic hedgehog was coupled to Affi-Gel 10 (Bio* 
Rad, Inc.). The depleted antiserum was first adsorbed to this matrix in batch, then transferred 
to a column. The matrix was washed with TEST (25 mM Tris-HCI [pH 7.5], 140 mM NaCl, 
5 mM KCl, 0.1% Triton-X-100), and the purified antibodies were eluted with ten bed 
volumes of 0.15 M Glycine [pH 2.5]. The solution was neutralized with one volume of 1 M 
Tris-HCI [pH 8.0], and dialyzed against 160 volumes of PBS. 

Other antibodies have been graerated against hedgehog proteins and three polyclonal 
rabbit antisera obtained to hh protems can be characterized as foIIows:Ab77 -reacts only v^th 
the carboxyl processed chick Shh peptide (27 kd); Ab79 -reacts with amino processed chick, 
mouse and human Shh peptide (19 kd). Weakly reacts with 27 kd peptide fiom chick and 
mouse. Also reacts with mouse Ihk\ and Ab80 -reacts with only amino peptide (19kd) of 
chick, mouse and human. 

(ii) In Vitro Translated Sonic Hedgehog is Proteotytically Processed and Glycosylated 

The open reading fi:ames of chick and mouse Shh encode primary translation products 
of 425 and 437 amino acids, respectively, with predicted molecular masses of 46.4 
kilodaltons (kDa) and 47.8 kDa (Echelard, Y. et al., (1993) Cell 75:1417-1430; Riddle, R.D. 
et al., (1993) Cell 75:1401-1416). Further examination of tiie protein sequences levealed a 
short stretch of amino terminal residues (26 for chick, 24 for mouse) that are highly 
hydrophobic and are predicted to encode signal peptides. Removal of these sequences would 
generate proteins of 43.7 kDa (chick Shh) and 45.3 kDa (mouse Shh). Also, each protein 
contams a smgle consensus site for N-linked glycosylation (Tarentino, A.L. et al., (1989) 
Methods Cell Biol 32:111-139) at residue 282 (chick) and 279 (mouse). These features of 
the Shh proteins are summarized in Figure 11. 

A rabbit reticulocyte lysate programmed with in vitro translated messenger RNA 
encoding either chick or mouse Shh synthesizes proteins with molecular masses of 46 kDa 
and 47 kDa, respectively, these values are in good agreement with those predicted by 
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examination of the anuno acid sequences. To examine posttranslational modifications of Shh 
proteins, a preparation of canine pancreatic microsomal membranes was included in the 
translation reactions. This preparation allows such processes as signal peptide cleavage and 
core glycosylation. When the Shh proteins are synthesized m the presence of these 

5 membranes, two products with apparent molecular masses of approximately 19 and 28 kDa 
(chick), or 19 and 30 kDa (mouse) are seen in addition to the 46 kDa and 47 kDa forms. 
When the material synthesized in the presence of the membranes is digested witii 
Endoglycosidase H (Endo H), the mobilities of the two larger proteins are mcreased. The 
apparent molecular masses of the Endo H digested forms are 44 kDa and 26 kDa for chick 

10 Shh, and 45kDa and 27 kDa for mouse Shh. The decrease in the molecular masses of the 
largest proteins synthesized in the presence of the microsomal membranes after Endo H 
digestion is consistent with removal of the predicted signal peptides. The mobility shift 
following Endo H treatment indicates that N-linked glycosylation occurs, and that the 26 kDa 
(chick) and 27 kDa (mouse) proteins contain the glycosylation sites. 

IS The appearance of the two lower molecular weight bands (hereafter referred to as the 

'^processed forms") upon translation in the presence of microsomal membranes suggests that 
a proteolytic event in addition to signal peptide cleavage takes place. TThe combined 
molecular masses of the processed fomis (19 kDa and 26 kDa for chick; 19 kDa and 27 kDa 
for mouse) add up to approximately the predicted masses of the signal peptide cleaved 

20 proteins (44 kDa for chick and 45 kDa for mouse) suggesting that only a single additional 
cleavage occins. 

The mouse Shh protein sequence is 12 amino acid residues longer than tilie chick 
sequence (437 versus 425 residues). Alignment of the chick and mouse Shh protein 
sequences reveals that these additional amino acids are near the carboxy terminus of the 
25 protein (Echeiard, Y, et al., (1993) Cell 75:1417-1430). Since the larger of the processed 
forms differ in molecular mass by approximately 1 kDa between the two species, it appears 
that these peptides contain the carboxy terminal portions of the Shh proteins. The smaller 
processed forms, whose molecular masses are identical, presumably consist of the anuno 
terminal portions. 

30 (in) Secretion of Shh Peptides 

To investigate the synthesis of Shh proteins in vivo, the mouse protein was expressed 
m several different eukaryotic cell types. In order to detect synthesized protein, and to 
facilitate future purification, the carboxy terminus was engineered to contain a twenty-five 
amino acid sequence containing a recognition site for the thrombin restriction protease 
35 followed by a ten amino acid sequence derived from the human c-myc protein and six 
consecutive histidine residues. The c-myc sequence serves as an epitope tag allowing 
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detection by a monoclonal antibody (9E10; Evan, G.I. et al., (1985) Mol Cell Biol. 5:3610- 
3616). The combined molecular mass of the carboxy terminal additions is approximately 3 
kDa. 

Xenopus laevis oocytes 

5 Inmiimoprecipitation with the c-myc antibody detects several proteins in lysates of 

metabolically labeled Xenopus laevis oocytes injected with Shh mRNA. Cell lysates and 
medium ftom ^^S labeled oocytes injected with RNA encoding mouse Shh with the c-myc 
epitope tag at the at the carboxy terminus, or from control oocytes were analyzed by 
immunoprecipitation with c-myc monoclonal antibody. A band of approximately 47 kDa is 

10 seen, as is a doublet migrating near 30 kDa. Treatment with Endo H increases the mobility of 
the largest protein, and resolves the doublet into a single species of approximately 30 kDa. 
These observations parallel the behaviors seen in vitro. Allowing for the added mass of the 
carboxy temdnal additions, the largest protein would correspond to the signal peptide cleaved 
form, while the doublet would represent the glycosylated and unglycosylated larger processed 

1 5 fomi. Since the epitope tag was placed at the caiboxy terminus of the protein, the identity of 
the 30 kDa peptide as the carboxy terminal portion of Shh is confirmed. Failure to detect the 
19 kDa species supports its identity as an amino terminal region of the protein. 

To test whether Shh is secreted by Xenopus oocytes, the medium in which the 
injected oocytes were incubated was probed by immunoprecipitation with the c-^myc 

20 antibody. A single band migrating slightly more slowly than the glycosylated larger 
processed form was observed. This protein is insensitive to Endo H. This result is expected 
since most secreted glycoproteins lose sensitivity to Endo H as they travel through the Golgi 
apparatus arid are modified by a series of glycosidases (Komfeld, R. and Komfeld, S., (1985) 
Annu, Rev. Biochem. 54:631-664). The enzymatic maturation of the Asn-linked carbohydrate 

25 moiety could also explain the slight decrease in mobility of tiiie secreted larger protein versus 
the intracellular material. Following Endo H digestion, a band with a slightly lower mobility 
than the signal peptide cleaved protein, is also apparent, suggesting that some Shh protein is 
secreted without undergoing proteolytic processing. Failure to detect this protein in the 
medium witiiout Endo H digestion suggests heterogeneity in the extent of carbohydmte 

30 modification in &e Golgi preventing the material from migrating as a distinct band. 
Resolution of this material into a single band following Endo H digestion suggests that the 
carbohydrate structure does not mature completely in die Golgi apparatus. Structural 
dififerences between the unprocessed protein and the larger processed fomi could account for 
this observation (Komfeld, R. and Komfeld, S., (19B5) Anmi Rev, Biochem. 54:631-664). 



35 
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Cos cells 

The behavior of mouse Shh in a mammalian cell type was investigated using 
transfected cos cells. Synthesis and secretion of flie protein was monitored by 
immunoprecipitation using the c-myc antibody. Transfected cos cells express the same Sonic 
5 hedgehog species that were detected in the mjected Xenopus oocytes, and their behavior 
following Endo H digestion is also identical. Furthermore, secretion of the 30 kDa 
glycosylated form is observed in cos cells, as well as the characteristic insensitivity to Eado 
H after secretion. Most of the secreted protein co-migrates with the intracellular, 
glycosylated lai^er processed form, but a small amount of protein with a slightly lower 
10 mobility is also detected inlfae medium. As in the Xenopus oocyte cultures, some 5/z/i which 
has not und^gone proteolytic processing is evident in the medium, but only after Endo H 
digestion. 

Baculovirus infected cells 

To examine the behavior of the mouse Shh protein in an invertebrate cell type, and to 
IS potentially purify Shh peptides, a recombinant baculovirus was constructed which placed the 
Shh coding sequence, with the caiboxy terminal tag, unda tiie control of the baculoviral 
Polyhedrin gene promoter. When insect cells were infected vdtfa the recombinant 
baculovurus, Shh peptides could be detected in cell lysates and medium by Western blotting 
with the c-myc antibody. 

20 The Shh products detected in this system were similar to those described above. 

However, virtually no unprocessed protein was seen in cell lysates, nor was any detected in 
the medium after Endo H digestion. This suggests that tiie proteolytic processing event 
occurs more ef&cientiy in these cells than in either of the other two cell types or the in vitro 
translation system. A doublet corresponding to the glycosylated and unglycosylated 30 kDa 

25 forms is detected, as well as the secreted, Endo I resistant peptide as seen in the other 
expression systems. Unlike the other systems, however, all of the secreted larger processed 
form appears to comigrate with the glycosylated intracellular material. 

(iv) Secretion of a Highly Conserved Amino Terminal Peptide 

To detemmie tiie behavior of the amino terminal portion of the processed Sonic 
30 hedgehog protein, the c-myc epitope tag was positioned 32 amino acids after the putative 
signal peptide cleavage site (Figure 12). Cos cells were transfected with Shh expression 
' constructs containing the c-myc tag at the carboxy terminus or near the amino terminus. 

When this construct was expressed in cos cells, both the full length protein and the smaller 
processed form (approximately 20 kDa due to addition of the c-myc tag) were detected by 
35 immunoprecipitation of extracts from labeled cells. However, the 20 kDa product is barely 
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detected in the medium. In cells transfected in parallel with the carboxy terminal c-myc 
tagged construct, the full length and 30 kDa products were both precipitated from cell lysates 
and medium as described earlier. 

As &e amino terminal c-myc tag may affect the secretion efficiency of the smaller 
processed form, the e^qiression of this protein was examined in cos cells using an antiserum 
directed against amino acids 44 through 143 of mouse SKh (Figure 12). After transfection 
with the caiboxy-terminal c-myc tagged constmct, immunoprecipitation with the anti-S%/t 
sertm detected a very low level of the smaller processed form in the medium despite a strong 
signal in &e cell lysate. This recq)itulates the results with the myc antibody. 

To examine the subcellular localization of Shh proteins, cos cells were transfected 
with the carboxy terminal tagged Shh construct and plated on multi-chamber slides, fixed and 
penneabilized. The cells were incubated simultaneously with the anti-iS%A serum and the c- 
myc antibody followed by FTTC conjugated Goat anti-Rabbit-IgG and RTTC conjugated Goat 
anti-Mouse-IgG. DAPI was mcluded to stain nuclei. Strong perinuclear staining 
characteristic of the Golgi apparatus was observed with the znH&rShh serum. The same 
subcellular region was also stained using the c-myc antibody. The comcidence of staining 
patterns seen with the two antibody preparations suggest that the low level of the smaller 
processed fonn detected in the medium is not due to its retention in the endoplasmic 
reticulum. 

(v) Hedgehog Processing 

In summary, the results discussed above demonstrate that the mouse and chick Shh 
genes encode secreted glycoprotems vMch undergo additional proteolytic processing. Data 
indicate that this processing occurs in an apparently similar fashion in a variety of cell types 
suggesting that it is a general feature of the Shh protein, and not unique to any particular 
expression system. For mouse Shh^ data indicate that both products of this pioteolj^c 
processing are secreted. These observations are summarized in Figure 13. 

It was observed that the 19 kDa amino peptide accumulates to a lower level in the 
medium than the 27 kDa carboxyl peptide. This may reflect inefficient secretion or rapid 
turnover of diis species once secreted. Alternatively, the smaller form may associate with the 
cell sui&ce or extracellular matrix components making it difficult to detect in the mediimi. 
The insensitivity of the secreted, larger form to Endo H is a common feature of secreted 
glycoproteins. During transit tiuough the Golgi q)paratus, the Asn-linked carbohydrate 
moiety is modified by a series of specific glycosidases (reviewed in Komfeld, R. and 
Komfeld, S., (1985) Armu. Rev. Biochem 54:631-664; Tarentino, A.L, et al., (1989) Methods 
Cell Biol 32:111-139). These modifications convert the structure fix)m the immature "high 
mannose" to the mature "complex" type. At one step in this process, a Golgi enzyme, a- 
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mannosidase II, removes two mannose residues from the complex rendering it insensitive to 
Endo H (Komfeld, R. and Komfeld, S., (1985) Rev, Biochem 54:631-664). 

Based on the observed molecular masses of the processed forms of mouse and chick 
Shh, the predicted secondary proteolytic cleavage site would be located near the border of the 
5 sequences encoded by the second and thbd mons. Interestingly, this region marks the end of 
the most highly related part of the hedgehog proteins. The amino terminal (smaller) form 
would contain the most highly conserved pordon of the protein. In &ct, the amino acids 
encoded by exoxis one and two (exclusive of sequences iq>stream of the putative signal 
peptide cleavage sites) share 69% identity between Dros-HH and mouse Shh, and 99% 

10 identity between chick and mouse Shh. Amino acid identity in the region encoded by the 
third exon is much lower 30% mouse to Drosophila and 71% mouse to chick (Echelard, Y. et 
al., (1993) Cell 75:1417-1430). Therefore, the two processed forms of Shh may have 
conserved as well as divergent signaling activities separated into distinct coding exons in the 
Shh gene. Furthermore, the observation that some unprocessed protein is secreted by 

15 Xenopus oocytes and cos cells raises the possibility that it may have a separate fimcdon. 

The biochemical behavior of mouse Shh sqppears to be quite similar to that described 
for the Drosophila Hedgehog (Dros-HH) protein (Lee, Ji. et al., (1992) Cell 71:33-50; 
Tabata, T. et al., (1992) Genes & Dev. 6:2635-2645). In vitro translation of Dros-HH 
mRNA, in the presence of microsomes, revealed products witii molecular masses 

20 corresponding to full length protein, as well as to the product expected after cleavage of the 
predicted mtemal (Type II) signal peptide (Lee, J,L. et al., (1992) Cell 71:33-50). 
Interestingly, no additional, processed forms were observed. However, such forms could 
have been obscured by breakdown products migrating between 20 and 30 kDa. When an 
RNA encoding a form of the protein lacking the carboxy-terminal 61 amino acids was 

25 translated, no breakdown products were seen, but there is still no evidence of the proteolytic 
processing observed with mouse ShK A similar phenomenon has been observed in these 
experiments. A reduction in the extent of proteolytic processing is seen when a mouse Shh 
protein lacking 10 carboxy-terminal amino acids is translated in vitro or expressed in cos 
cells (data not shown). This suggests that sequences at the carboxy temuni of Dros-HH 

30 proteins act at a distance to influence the efSciency of processing. 

In vivOy processing of Dros-HH has been demonstrated (Tabata, T. et al., (1992) 
Genes & Dev, 6:2635-2645). Inmiunoblots of lysates from Schneider cells transfeoted with a 
hh expression vector reveal two smaller molecular weight forms similar to those described 
for mouse Shh. These products were also detected in extracts of larvae and imaginal discs 
35 derived from flies e?q3ressing a heat shock mducible hh construct. Thus, it is clear diat there 
are also several distinct forms of Dros-HH proteins. 
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(vi) Hedgehog Signaling 

In order to satisfy the criteria for intercellular signaling, hedgehog protons must be 
detected outside of their domains of expression. This has been clearly demonstrated for 
Dros-HH. Using an antiserum raised against nearly full length Dros-HH protem, Tabata and 

5 Romberg (Tabata, T. and Komberg, T.B.. (1992) Cell 76:89-102) detect the protein in stripes 
that are slightly wider than the RNA ejqjression domains in embryonic segments, and just 
anterior to the border of the RNA expression domain in wing imaginal discs. Similarly, 
Taylor, et al., (1993) Meek Dev. 42:89-96, detected Drois-HH protdn in discrete patches 
withm cells adjacent to those expressing hh RNA in embryonic segments using an antiserum 

10 directed against an amino-terminal portion of Dros-HH which, based on flie proteolytic 
processing data (Tabata, T. et al., (1992) Genes & Dev. 6:2635-2645), is not likely to 
recognize the carboxyl cleavage product 

The detection of Dros-HH beyond cells expressing the hh gene is consistent with 
the phenotype of hh mutants. In these animals, ceUular patterning in each embryonic 

15 parasegment in disrupted resulting in an abnormal cuticular pattern remkdscent of that seen 
mwg mutants. Further analysis has revealed that the loss of hh gene function leads to loss of 
wg expression in athin stripe of cells just antoior tol3ashh expression domain (Ingham, P.W. 
and Hidalgo, A., (1993) Development 117:283-291). This suggests that Dros-HH acts to 
maintam wg expression in neighboring ceUs. The observation that ubiquitously expressed 

20 Dros-HH leads to ectqnc activation of wg supports this model (Tabata, T. and Komberg, 
T.B., (1992) Cell 76:89-102). In addition to these genetic studies, there is also indirect 
evidence that Dros-HH acts at a distance from its site of expression to influence patteriiing of 
the epidermis (Heemskerk, J. and DiNaido, S., (1994) Cell 76:449-460). 

The apparent effect of Dros-HH on neighboring cells, as well as on those located at 
25 a distance from the site of hh expression is reminiscent of the influence of the notochord and 
floor plate on the developing vertebrate CNS, and of the ZPA in the limb. The notochord (a 
site of high level Shh expression) induces the formation of the floor plate in a contact 
dependent manner, while the notochord and floor plate (another area of sttong Shh 
expression) are boA capable of mducing motomeurons at a distance (Placzek, M. et al., 
30 (1993) Development 1 17:205-21 8; Yamada, T. et al., (1993) Cell 73:673-686). 

Moreover ZPA activity is required not only for patterning cells in tiie extreme 
posterior of the limb bud where Shh is transcribed, but also a few hundred microns anterior of 
this zone. Several lines of evidence indicate that Shh is able to induce floor plate (Echelard, 
Y. et al., (1993) Cell 75:1417-1430; Roelink, H. et al., (1994) Cell 76:761-775) and mediate 
35 the signaling activity of the ZPA (Riddle, R.D. et al., (1993) Cell 75:1401-1416). Since it 
has been shown that Shh is cleaved, it can be speculated that the processed peptides may have 
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distinct activities. The smaller amino terminal form, which appears to be more poorly 
secreted, less stable or retained at the cell surface or in the extracellular matrix, may act 
locally. In contrast, the larger carboxy terminal peptide could possibly function at a distance. 
In this way, Shh peptides may mediate distinct signaling functions in the vertebrate embryo. 

5 

Example 7 

Sonic hedgehog emd Fgf'4 act through a signaling cascade and feedback loop to integrate 
growth and patterning of the developing limb bud 

(i) Experimental Procedures 
10 Cloning ofChickenFgf'4 and Bmp'2 

A 246 bp jBragment of the chicken Fgf'4 gene was cloned by PGR from a stage 22 
chicken limb bud library. Degenerate primers were designed against previously cloned Fgf'4 
and Fgf'6 genes: fgf5' (sense) AAA AGC TTT AYT GYT AYG TIG GIA THG G (SEQ ED 

15 No:38) and fgG' (antisense) AAG AAT TCT AIG CRT TRT ART TRT TIG G (SEQ ID 
No:39). Denaturation was at 94^C for 2 min, followed by 30 cycles of 94^C for 30 sec, 50**C 
for 60 sec, and 72**C for 30 sec, with a final extension at 72X for 5 min. The PGR product 
was subcloned into the Bluescript SK+ vector. A clone was sequenced and confirmed as Fgf 
' 4 by comparison with previously published Fgf-4 genes and a chicken F^'4 gene sequaice 

20 kindly provided by Lee Niswander. 

BMP-related sequences were amplified from a stage 22 posterior limb bud cDNA 
library prepared in Bluescript using primers and conditions as described by Basler, et al. 
(1993). Amplified DNAs were cloned and used to screen a stage 22 limb bud libnuy 
prepared in X.-Zap (Stratagene). Among the cDNAs isolated was chicken j5m/7-2. Its identity 

25 was confirmed by sequence comparison to the published clones (Francis, et al., (1994) 
Development 120:209-218) and by its e3q)ression patterns in chick embryos. 

Chick Surgeries and Recombinant Retroviruses 

30 All experimental manipulations were performed on White Leghorn chick embryos (S- 

SPF) provided by SPAFAS (Norwich, Conn). Eggs were staged according to Hamburger and 
Hamilton (1951) J. Exp Morph. 88:49-92. 

Viral supematants of 5o«/c/RCAS-A2 or a variant containing an influenza 
hemaglutinin epitope tag at the carboxyl terminus of the hedgehog protein {Sonicl. 1/RCAS- 

35 A2, functionally indistinguishable from iSow/c/RCAS-A2), were prepared as described 
(Hughes, et al., (1987) J. Virol 61:3004-13; Fekete and Cepko, (1993) Mol & Cell Biol 
13:2604-13; Riddle, et al., (1993) Cell 75:1401-16). For focal injections the right wings of 
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Stage 18-21 embryos were transiently stained with nile blue sulfate (0.01 mg/ml in Ringer's 
solution) to reveal the AER. A trace amount of concentrated viral siqiematant was injected 
beneath the AER. 

The AER was removed using electrolytically sharpened tungsten wire needles. Some 
embryos had a hq)arin-acrylic bead soaked in FGF-4 solution (0.8 mg/ml; a gift ftom 
Genetics Institute) or PBS stapled to the limb bud with a piece of 0.025mm platinum wire 
(Goodfellow, Cambridge UK) essentially as described by Niswander et al, (1993) Ce// 
75:579-87. 

Limbs -wbich were infected with Sonic/RCAS virus after AER removal were infected 
over a large portion of the denuded mesodom to ensure substantial infection. Those embryos 
which received both an Fgf-4 soaked bead and virus were infected only tmdemeadi the bead. 

In Situ Hybridizations and Photography 

Single color -whsAt mount in situ hybridizations were performed as described (Riddle, 
et al., (1993) Cell 75:1401-16). Two color whole mount in situ hybridizations were 
perfonned essentially as described by Jowett and L^ce (1994) Trends Genet. 10:73-74. 
The second color d^ection was developed using 0.125mg/ml magenta-phos (Bipsynth) as the 
substrate. Radioactive in situ hybridizations on 5^m sections was paformed essentially as 
described by Tessarollo, et al. (1992) Development 1 1 5:1 1-20. 

The following probes were used for vAnale mount and section in situ hybridizations: 
Sonic. 1.7kb fiagment of pHH2 (Riddle, et al., (1993) Cell 75:1401-16). Bnq>-2: 1.5 kb 
frs^ent encoding tiie entire open reading frame. Fgf-A: 250 bp fiagment described above. 
Hox d-lJ: a 600 bp fiagment, Hmcd-13: 400 bp fiagment both iiicluding 5' untranslated 
sequences and coding sequences iq)sb«am of the homeobox. RCAS: 900 bp Sall-Clal 
fiagment of RCAS (Hughes et al., (1987) J rirol. 61:3004-12). 

(ti) Relationship of Sonic to &uhgenous Bmp-2 andHoxd Gene Rqtression 

The best candidates for genes regulated by Sonic in vivo are tiie distal memba-s of the 
Hoxd gene cluster, Hoxd-9 through -13, and Bmp-2. Therefore, the relationships of the 
expression domains of these genes in a staged series of normal chick embryos were analyzed. 
Hoxd-9 and Hoxd- JO are expressed throughout the presumptive wing field at stage 16 
(Hamburger and Hamilton, (1951) J. Exp. Morph. 88:49-92), prior to the first detectable 
expression of Sonic at early stage 18. Hoxd-11 expression is first detectable at early stage 18, 
the same time as Sonic, in a domain coextensive with Sonic. Expression of Hoxd- 12 and 
Hoxd-13 commence shortly thereafter. These results suggest that Sonic might normally 
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induce, directly or indirectly, the expression of only the latter three members of the cluster, 

even though all five are nested within the early limb bud. 

As limb outgrowth proceeds Sonic expression remains at the posterior margin of the 

bud. In contrast the Hoxd gene expression domains, which are initially nested posteriorly 
5 around the Sonic domain, are very dynamic and lose their concentric character. By stage 23 

the Hoxd'll domain extends anteriorly and distally far beyond that of Sonic^ while Hoxd- J 3 

expression becomes biased distally and displaced from Sonic, 

While it is not clear whether Bmp-2 is expressed before Sonic (see Francis et al,, 

(1994) Development 120:209-218) Bmp-2 is expressed in a mesodennal domain which 
10 apparently overlaps and surrounds that of Sonic at the earhest stages of Sonic expression. As 

the limb bud develops, the mesodermal expression of 5wp-2 remains near the posterior limb 

margin, centered around that of Sonic, but in a larger domain than Sonic. This 

correspondence between Sonic and Bmp'2 expression lasts until around stage 25, much 

longer than the correspondence between Sonic and Hoxd gene expression. After stage 25 
15 Bmp-2 expression shifts distally and is no longer centered on Sonic, 

(Hi) Relationship of Sonic to Induced Bmp^2 and Hoxd Gene Expression 

The fact that the egression domains of the Hoxd genes diverge over time fix)m that of 
20 Sonic hedgehog implies that Sonic does not directly regulate their later patterns of expression. 
This does ^ot preclude the possibility that the later expression donudns are gmetically 
downstream of Sonic. If this were the case, exogenously e?q)ressed Sonic would be expected 
to initiate a program of Hoxd gene expression which recapitulates that seen endogenously. 
Therefore, the spatial distribution of Hoxd gene expression at various times following Sonic 
25 misexpression was compared. The anterior marginal mesoderm of early bud (Stage 18*20) 
wings was injected at a single point under the AER wi& a replication competent virus ihat 
expresses a chicken Sonic cDNA. Ectopic Sonic expressed by this protocol leads to both 
anterior mesodermal outgrowth and anterior extension of the APR. 

The Sonic and Hoxd gene expression domains in the infected limbs were analyzed in 
30 sectioned and intact embryos. Viral Sonic message is first detected approximately 18 hours 
after infection at the anterior margin, at the same time as, and approximately coextensively 
with, induced Hoxd-ll. This suggests that Sonic can rapidly induce Hoxd-ll egression and 
that the lag after injection represents the time required to achieve Sonic expression. By 35 
hours post infection distal outgrowth of infected cells combined with lateral viral spread 
- 35 within the proliferating cells leads to viral expression in a wedge which is broadest at the 
distal margin and tapers proximally. By this time, Hoxd-ll expression has expanded both 
antero-proximally and distally with respect to the wedge of 5ow/c-expressing cells, into a 
domain which appears to mirror the more distal aspects of the endogenous Hoxd-ll domain. 
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Weak Hoxd-13 expression is also detected at 35 hours in a subset of the Sonic expressing 
domain at its distal margin. 51 hours after infection the relationship of Sonic and Hoxd-1 J 
expression is similar to that seen at 35 hours, while the induced Hoxd-JS expression has 
reached wild type levels restricted to the distal portions of the ectopic growth. Thus the 
ectopic Hoxd expression domains better reflect the endogenous patterns of expression than 
they do the region expressing Sonic. This suggests that there are multiple fectors regulating 
ifooof ejqwession but their actions lie downstream of 5omc. 

Since the endogenous Bmp-2 expression domain correlates well with that of Sonic, 
and Bmp-2 is induced by ZPA grafts, it was looked to see if Bmp-2 is also induced by Sonic. 
Bmp-2 is normally expressed in two places in the early limb bud, in the posterior mesoderm 
and throughout the AER (Francis, et al., (1994) Development 120:209-218). In mjected limb 
buds additional Bmp-2 expression is seen in both the anterior mesodenn and in the anteriorly 
extfflided AER. The domain of Bmp-2 expression is slightly more restricted than that of viral 
expression, suggesting a delay in Bmp-2 induction. Bi7^2 expression in both the mesoderm 
and ectodenn is thus a downstream target of Sonic activity in the mesoderm. In contrast to 
the expression domains of the Hoxd genes, the endogenous and ectopic Bmp-2 expression 
domains correlate weU with that of Sonic. This suggests that Bmp-2 expression is regulated 
more directly by Sonic than is expression of the Hoxd genes. 

(iv) The AER and Conq>etence to tlnspond to Sonic 

Ectopic activation of Hoxd gene expression is biased distally in virally infected 
regions, suggesting that ectodermal fectors, possibly from the AER, are required for Hoxd 
gene induction by Sonic. To test this. Sonic virus was injected into the proximal, medial 
mesoderm of stage 21 limb buds, presumably beyond the mfluence of the AER. Although 
the level of Sonic expression was comparable to that observed in distal injections, proximal 
misexpression of Sonic did not result in ectopic induction of the Hoxd genes or Btig}-2, nor 
did it result in any obvious morphological effect (data not shown). The lack of gene 
induction following proximal inisexpreSsion of Sonic suggests that exposure to Sonic alone is 
insufficient to induce expression of these genes. 

This was tested more rigorously by injection of Sonic virus into the anterior marginal 
mesoderm of stage 20/21 limb buds after the anterior half of the AER had been surgically 
removed. Embryos were allowed to develop for a fiirther 36 to 48 hours before harvesting. 
During this time the AER remaining on the posterior half of the limb bud promotes ahnost 
wild type outgrowth and patterning of the bud. Gene expression was monitored both in 
sectioned and intact embryos. In the presence of the AFR, Sonic induces both anterior 
mesodermal proliferation and expression of Hoxd-1 1, Hoxd-13 and Bmp-2. In the absence of 
the overlying AER, Sonic does not induce either mesodermal proliferation or expression of 
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these genes above background. Signals from the AER are thus required to allow both the 
proliferative and patterning effects of Sonic on the mesoderm. 

Since application of FGF protein can rescue other functions of the AER such as 
promoting PD outgrowth and patterning, it was sought to determine whether FGFs might also 
5 promote mesodermal competence to respond to Sonic. FGF-4-soaked beads were stapled to 
AER-denuded anterior mesoderm which was infected with Sonic virus. Gene expression and 
mesodermal outgrowth were monitored as described previously. In the presence of both 
Sonic virus and FGF-4 protein, Hoxd-ll, Hoxd-13 and Bmp-2 e)q)ression are all induced. 
The expression levels of the induced genes are similar to or greater than the endogenous 

10 expression levels, and are equivalent in magnitude to their induction in the presence of the 
AER. Thus Fgf4 can induce the competence of the mesoderm to respond to Sonic. 

Sonic alone is insufiScient to induce either gene expression or mesodermal 
proliferation in the absence of the AER, while the combination of Sonic and FGF-4 induces 
both proliferation and gene e)q)ression. It was than asked whether FGF-4 alone has any 

15 effect on gene induction or mesodermal proliferation. Application of FGF-4 in the absence 
of Sonic virus does not induce Hoxd or Bmp-2 gene expression above control levels, however 
FGF-4 alone induces mesodermal outgrowth. These results suggest that mesodermal gene 
activation requires direct action of Sonic on the mesoderm and that proliferative response to 
Sonic is indirect, due to the induction of FGFs. 

20 

(v) Sonic Induces PolcaizedFgf4 Expression in the AER 

Fgf-4 is expressed in a graded &shion in the AER of the mouse limb bud, ivith 
maximal expression at the posterior region of the AER tapering to undetectable levels in the 

25 anterior ridge (Niswander and Martin, (1992) Development 114:755-68). Therefore, it was 
^propriate to investigate whether Fgf'4 is asymmetrically expressed in the chick AER, and 
whether its expression is induced by Sonic. A fragment of the chicken Fgf^ gene was cloned 
from a stage 22 chicken limb library by PGR using degenerate primers designed from mouse 
Fgf-4 and Xenopiis e-Fgf sequence; based on information provided by L. Niswander and G. 

30 Martin. Assignment of gene identity was based on primary sequence as well as comparison 
of expression patterns with that of murine Fgf'4 (Niswander and Martin, (1992) Development 
1 14:755-68). Whole mount in situ hybridization analysis showed strong limb expression of 
chick Fgf'4 in the AER. Fgf-4, like Bmp'2, is expressed all the way to the posterior border of 
the AER, but its anterior domain ends before the morphological end of the AER creating a 
' 35 posterior bias that has also been observed by Niswander et al,, (1994) Nature (in press). 
Expression is first detected in the distal AER at about stage 18. As outgrowth proceeds the 
posterior bias develops. Expression peaks around stage 24/25 and then fades by stage 28/29. 
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The expression domain of Fgf-4 becomes posteriorly biased as Sonic is expressed in 
the posterior mesoderm. This observation is consistent with Sonic influencing the expression 
of Fgf-4 in the posterior AER. To test the effect of Sonic on Fgf-4 ejqnession in the AER, 
stage 1 8-20 embryos were infected with Sonic virus in a single point at their anterior margin 
beyond the anterior limit of the AER. The embryos were harvested one to two days later, 
vidien an extension of the anterior AER became apparent. The expression of Fgf-4 was 
analyzed by in situ hybridization . Fgf-4 expression is induced in the anteriormost segment 
of the AER, in a region which is discontinuous with the endogenous expression domain, and 
overlies the domain of viral Sonic infection. This result contrasts with the Bnqh2 ejqitession 
mduced m the extended AER, which is always continuous with the endogenous e3q)ression 
domain. The asymmetry of the induced Fgf-4 expression indicates that Sonic polarizes the 
extended AER, much as a ZPA graft does (Maccabe and Parker, (1979) J. Embryol Exp. 
Morph 53:67-73). Since FGFs by themselves are mitogenic for limb mesoderm, these 
results are most consistent with Sonic inducing distal proliferation indirectly, through the 
induction of mitogens in the ovorlying AER. 



(vi) Reciprocal Regulation of Sonic by Fgf-4 

Sonic thus appears to be upstream of Fgf-4 expression in the AER. However, since 
the AER is requned to maintain polarizing activity m the posterior mesoderm (Vogel and 
Tickle, (1993) Development 19:199-206; Niswander et al., (1993) Cell 75:579-87), Sonic 
may also be downstream of the AER. If Sonic is regulated by the AER and the AER by 
Sonic, this would imply tiiat they are reinforcing one another through a positive feedbadc 
loop. 

To test whether Ae AER dependence of ZPA activity is controlled at the level of 
transcription of the Sonic gene. Sonic e}q>ression following removal of the AER from the 
posterior half of the limb bud was assayed. Sonic expression is reduced in an operated limb 
compared to the contralateral control limb within ten hours of AER removal, indicating that 
Sonic expression is indeed AER dependent. Tlie dependence of Sonic expression on signals 
from the AER suggests that one of the fimctions of the AER is to constrain Sonic expression 
to the more distal regions of the posterior mesoderm. 

In addition to their mitogenic and competence-inducing properties, FGFs can also 
substitute for the AER to maintain the ZPA. In order to test whether FGFs can support the 
expression of Sonic, beads soaked in FGF-4 protein were stapled to the posterior-distal tips of 
limb buds after posterior AER removal. Embryos were assayed for Sonic expression 
approximately 24 hours later, when Sonic expression is greatly reduced in operated limb buds 
which had not received an FGF-4 bead. Strong Sonic expression is detectable in the posterior 
mesoderm, sUghtly proximal to the bead implant, and reflecting the normal domain of Sonic 
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ejq)ression seen in the contralateral limb. With the finding that FGF-4 can maintain Sonic 
expression, the elements required for a positive feedback loop between Sonic expression in 
the posterior mesoderm and Fgf^ ejqjression in the posterior AER are established (see also 
Niswander et al. (1994) Nature (in press)). 

The induction of Bmp-2 expression by Sonic requires signals from the AER, and its 
domain correlates over time with that of Sonic. Therefore, it was interesting to leam if the 
continued expression of Bmp'2 also requires signals from the AER, and if so, viiether they 
could be replaced by FGF-4. To test this, Bmp-2 expression following posterior AER 
removal, and following its substitution with an FGF-4 bead was assayed. Bmp-2 e3q)ression 
fades within hours of AER removal, and can be rescued by FGF-4. These data indicate that 
the maintenance of Bmp-2 expression in the posterior mesodom, like that of Sonic, is 
dependent on signals from the AER, v^ch are likely to be FGFs. 

(vii^ The Mesodermal Response to Sonic 

It has been found that only mesoderm underlying the AER is responsive to Sonic, 
apparently because the AER is required to provide competence signals to the limb mesoderm. 
Fgf^, w^ch is repressed in the AER, can substitute for the AER in this regard, and thus 
might act in combination with Sonic to promote Hoxd and Bmp-2 gene ejqjression in the 
mesoderm. FGFs may be permissive factors in a number of instructive pathways, as they are 
also required for activins to pattern Xenopus axial mesoderm (Cornell and Kimelman, (1994) 
Development 120:2187-2198; LaBonne and Whitman, (1994) Development 120:463-472). 

The induction of Hoxd mi Bmp-2 e^qxession m response to Sonic and FGF-4 in the 
absence of an AER suggests that ihe mesoderm is a direct tai^et tissue of Sonic protein. 
Since Sonic can induce Fgf-4 exptcs^m in the AER, it follows fbA Sonic also acts indirectly 
on the mesoderm tiirough die induction of competence &ctors in the arr. 

(viii) Downstream Targets and a Cascade of Signals Induced by Sonic 

The five AbdB-like Hoxd genes, Hoxd-9 through -75, are initially expressed in a 
nested pattem centered on ibe posterior of the limb bud, a pattern wfaidi suggests they mi^t 
be controlled by a common mechanism (DoUe, et al., (1989) Cell 75:431-441; Izpisua- 
Belmonte, et al., (1991) Nature 350:585-9). The analysis of the endogenous and induced 
domains of Hoxd gene esqnession suggests that Sonic normally induces e}q>ression of Hmd- 
11, -12 and -13. In contrast it was found that Hoxd-9 and -10 repression initiate before Sordc 
mRNA is detectable. This implies that at least two distinct mechanisms control the initiation 
of Hoxd gene expression in the wing bud, only one of v^ch is dependent on Sonic. 

Several observations suggest that the elaboration of the Hoxd expression domains is 
not controlled directly by Some, but rather by sigiuds which are downstream of Sordc. The 
Hoxd expression domains rapidly diverge from Sonic, and evolve into several distinct 
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subdomains. Moreover these subdomains appear to be separately regulated, as analysis of the 
murine Hoxd-11 gene promoter suggests that it contains independent posterior and distal 
elements (Gerard, et al., (1993) Embo. J. 12:3539-50). In addition, although initiation of 
Hoxd-11 through -13 gene expression is dependent on the AER, their expression is 
5 maintained following AER removal (Izpisua-Behnonte, et al., (1992) Embo. J. 11:1451-7). 
As Sonic expression fades rs^idly under similar conditions, this implies that maintenance of 
Hoxd gene expression is independent of Sonic. Since ectopic Sonic can induce a 
recapitulation of the Hoxd expression domains in the limb, it can be concluded that although 
indirect efifectors appear to regulate the proper patterning of the Hoxd expression domains, 

10 they are downstream of Sonic. Potential mediators of these indirect effects include Bng>-2 in 
the mesoderm and Fgf-4 from the AER, 

In contrast to the Hoxd genes, Bmp-2 gene expression in the posterior limb mesoderm 
spears to be continually regulated by Sonic. It was found that both endogenous and ectopic 
Bmp-2 e;q)ression correspond to that of Sonic. Furthermore, continued Bmp'2 expression is 

15 dqsendent on the AER and can be rescued by FGF-4. It is likely that this is an indirect 
consequence of the &ct that Sonic expression is also maintained by the AER and can be 
rescued by FGF-4. In feet, Bmp-2 expression might be a direct response of cells to secreted 
Sonic protein. The differences between Bnq)-2 and Hoxd gene expression suggest that 
multiple pathways downstream of Sonic regulate gene expression m the mesoderm. 

20 Bny>-2 itself is a candidate for a secondary signaling molecule in the cascade of 

patterning ^ents mduced by Sonic. Bnqh2 is a secreted molecule of the TGF-p femily and 
its oqnession can be induced by Sonic. This iQ>pears to be an evolutionarily conserved 
pathway, as Dros-HH, the Dtosophila homolog of Sonic, activates the esqaession of t^p, the 
homolog of Bnqh2y in the eye and wing imaginal discs (Heberlein, et al., (1993) Cell 75:913- 

25 26; Ma. et al., (1993) CeU 75:927-38; Tabata and Komberg, (1994) Cell 76:89-102). 
Eiqpression of Dros-HH is normally confined to the posterior of the wing disc. Ectopic 
oqnession of Dros-HH m the anterior of Ae disc results in ectopic expression of and 
ultimately in the duplication of wing structure with mirror image symmetry (Bassler and 
Stnihl, (1994) Nature 368:208-214). This effect is strikingly, parallel to the phenotypic 

30 results ofectopicexpresdonofS'omc in the chick limb. 

(ix) Regulation of Sonic Eiqjression 

Sonic expression is activated in the posterior of the limb bud very early during 
mesodermal outgrowth (Riddle et al., (1993) CeU 75:1401-16). The fectors which initiate 
35 this localized ej^jression are not yet identified but ectopic expression of Hoxb-8 at the 
anterior margin of the mouse limb bud results in the activation of a second domain of Sonic 
expression under the anterior AER (Charity el al., (1994) CeU 78:589-601). Since retinoic 
acid is known to be able to induce the e3q)ression of Hoxb-8 and other Hox genes in vitro 
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(Mavilio et al., (1988) Differentiation 21:13-19) it is possible that endogenous retinoic acid 
acts to make cells competent to express Sonic by inducing expression of upstream Hex genes, 
either in the very early limb bud or in the flank prior to the limb bud formation. 

Several lines of evidence suggest that once induced Sonic expression is dependent on 
signals from the posterior AER. Followmg its initiation in the posterior limb mesoderm, the 
Sonic expression domain moves distally as the limb bud grows out, always remaining 
subjacent to the AER. Similarly, Sonic expression can also be induced on the anterior margin 
of the limb bud by implantation of a retinoic acid bead, but the induced ectopic expression is 
limited to the mesoderm directly underlying the AER (Riddle, et al., (1993) Cell 75:1401-16). 
In addition, ZPA activity fedes rapidly following removal of the AER (Niswander, et al., 
(1993) Cell 75:579-87; Vogel and Tickle, (1993) Development 1 19:199-206), and ZPA grafts 
only function A^en placed in close proximity to the AER (Tabin, (1991) Cell 66:199-217; 
Tickle, (1991) Development Supp. 1:113-21). The observation that continued Sonic 
expression depends on signals from the posterior AER reveals the mechanism underlying 
these observations. 

The reliance of Sonic expression on AER-derived signals suggests an explanation for 
the distal shift in Sonic e>q»ession during limb development (Riddle et al., (1993) Cell 
75:1401-16). Signals from the AER also promote distal ou^wth of the mesodermal cells 
of the progress zone, which in turn results in the distal displacement of the AER. Hence, as 
maintenance of Sonic e>q)ression requires signals from the AER, its expx^ion domain will 
be sumlarly displaced. 

It was found that replacement of the AER with FGF'4 soaked beads results in the 
mamtenance of Sonic ejqjression. This result is consistent with the previous findings that 
ZPA activity can be maintained in vivo and in vitro by membere of the FGF femily 
(Anderson, et al., (1993) Development 117:1421-33; Niswander et al., (1993) Cell 75:1401- 
16 ; Vogel and Tickle, (1993) Development 1 19:199-206). Since Fgf^ is normally expressed 
in the postaior AER, these results suggest that Fgf^ is the signal from the ectoderm 
involved in maintaining Sonic expression. 

(x) Sonic and Regulation and Maintenance of the AER 

Sonic can induce anterior extensions of the AER which have an inverted polarity 
relative to the endogenous AER, This polarity is demonstrated by examining the expression 
of two markers in the AER. In normal limbs Bmp-2 is expressed throughout the AER, while 
Fgf4 is expressed in the posterior two thirds of the AER. In the extended AER resulting 
from ectopic Sonic expression, Bmp'2 is again found throughout the AER, while Fgf'4 
expression is biphasic, found at either end of the AER, overlying the anterior and posterior 
mesodermal domains expressing Sonic. These results are consistent with previous 
observations that antero-posterior polarity of the AER appears to be regulated by the 
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underlying mesoderm, and that ZPA gxafts lead to the induction of ectopic, polarized AER 
tissue (Maccabe and Parker, (1979) J, Embryol Exp. Morph. 53:67-73). Our results also 
suggest that the normal AP polarity of the AER is a reflection of endogenous Sonic 
expression. The induced AER is sufficient to promote complete PD outgrowth of the induced 
5 structures (Riddle et al., (1993) Cell 75:1401-16). Hence whatever factors are necessary to 

m^ Tntain the AER are also downstream of Sonic. ^ 

(xi) A Positive Feedback Loop Between Sonic and Fgf4 

10 The induction of Fgf^ expression by Sonic in the ectopic AER, and the maintenance 

of Sonic expression by FGF-4 suggest that Sonic and Fgf4 expression are normally sustained 
by a positive feedback loop. Such a feedback loop would allow the coordination of 
mesodermal outgrowth and patterning. This coordination is possible because Sonic patterns 
mesodermal tissue and regulates Fgf'4 expression, while FGF'4 protein induces mesodermal 

15 prolifemtion and maintains Sonic expression Moreover mesodermal tissue can only be 
patterned by Sonic in the context of a competence activity provided by F8f-4. Thus 
patterning is always coincident with proliferation. 

It remains possible that mogenously applied Fgf'4 might be mimicking the activity of 
a diffeient member of the FGF &mily. For example, Fgf-2 is expressed in the limb 

20 mesoderm and the AER (Savage et al., (1993) Development Dynamics 198:159-70) and has 
similar effects on lunb tissue as FgH (Niswander and Mardn, (1993) Nature 361:68-71; 
Niswander, et al., (1993) Cell 75:579-87; Riley, et al., (1993) Development 118:95-104; 
Fallon, et al., (1994) Science 264:104-7). 

25 (xii) Coordinated Regulation of Limb Outgrowth and Patterning 

Patterning and outgrowth of the developing limb are known to be regulated by two 
major signaling centers, the ZPA and AER. The identification of Sonic and FGFs as 
molecidar mediators of the activities of the ZPA and AER has allowed for dissociation of the 

30 activities of these signaling centers fi-om flieir regulation, and investigation of the signaling 
pathways through which they function. 

The results presented above suggest that the ability of cells to respond to Sonic 
protein is dependent on FGFs produced by the AER. It was also found that Sonic induces a 
cascade of secondary signals involved in regulating mesodermal gene expression patterns. In 

35 £iddition evidence was found for a positive feedback loop initiated by Sonic, which maintains 
expression of Sonic in the posterior mesoderai and Fgf'4 in the AER. The feedback loop 
described suggests a mechanism whereby outgrowth and patterning along the AP and PD 
axes of the limb can be coordinately regulated. 
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The results described above further suggest that Sonic acts as a short range signal 
which triggers a cascade of secondiary signals whose interplay detennines the resultant 
pattempf structures. The data suggest a number of inductive pathways that can be combined 
to generate a model (Figure 14) which describes how Sonic^ in coordination with the AER, 
acts to pattern mesodemial tissues along the anterior-posterior iimb axis, while 
simultaneously regulating proximal-distal ou^rowtfa. 

Following its induction. Sonic signals to both the limb ectodemi and mesodenn. 
Sonic imposes a distinct polarity on the forming AER, including the posteriorly biased 
expression of Fgf-A^ and the AER becomes dependent on continued Sonic expression. The 
mesoderm, as long as it is receiving permissive signals from the overlying ectodeim, 
responds to the Sonic signal by expressing secondary signaling molecules such as Bmp-2 and 
by activating Hoxd genes. Bmjh2 expression is directly dependent on continued Sonic 
expression, while the continued expression of the Hoxd genes, rapidly becomes Sonic. 
independent In a reciprocal &shion, maintenance of Sonic hedgehog expression in the 
posterior mesodemi becomes dependent on signals from the AER. Since the frictors 
expressed by the AER are not only required for the maintenance of Sonic e^q^ression and 
activity, but are also mitogenic, growth and patterning become inextricably linked. 
Coordination of limb development through interdependent signaling centers forces the AP 
and PD structures to be induced and pattemed in tandem. The pathways elucidated hmin 
thus provide a molecular framework for the controls governing limb patteming 

Example 8 

Sonic, and Hox Gene Expression Suggest a Conserved Pathway in Patterning the 

Vertebrate and Drosophila Gut 

(i) Experimental Procedure 

In Situ Hybridization and Photograph 

BMP probes were isolated using primers designed to amplify members of the TGF- 
and BMP femilies (Easier, K. et al., (1993) Cell 73:687-702, eight independent 120 bp BMP 
fragments were amplified from a stage 22 ducken posterior limb bud plasmid cDNA library. 
These fragments were pooled and used to screen an unamplified stage 22 limb bud lambda 
zap cDNA library constructed as in Riddle et al., (1993) Cell 75:1401-16 . Among the BMP 
related clones isolated were an approximately 1.9 kb cDNA clone corresponding to chicken 
BMP'2 and an approximately 1.5 kb cDNA clone corresponding to chicken BMP^. Botii 
clones contain the entire coding regions. The Sonic clone was obtained as described in 
Riddle et al, (1993) Cell 75:1401-16. Digoxigenin-UTP labeled RNA probes were 
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transcribed as per Riddle et al., (1993) Cell 75:1401-16. Briefly, harvested chick embryos 
were fixed overnight in 4% paraformaldehyde, washed in PBS then processed for whole 
mount in situ hybridization methods are per Riddle et al., (1993)Ce// 75:1401-16. Embryos 
were photographed fiom either ventral or dorsal surfaces under transmitted light using a 
5 Nikon zoom stereo microscope with Kodak Ektar 100 ASA film. Whole moimt in situ 
hybridization embryos and viscera were processed for sectioning as described m Riddle et al., 
(1993)Ce// 75:1401-16. 15-25 iim transverse sections were air dried and photographed with 
brightfield or numarski optics using a Zeiss Axiophot microscope and Kodak Ektar 25 ASA 
film. 

10 

Chick Embryos and Recombinant Retroviruses 

A retroviral vector engineered to express a fiill length cDNA of chicken Sonic, as in 
Riddle et al. (1993) Cell 75:1401-16, was injected unilaterally into stage 8-13 chicken 

15 embryos targeting the definitive mdoderm at the mid-embryo level. At this stage the CIP has 
not formed and neither Sonic nor BMP-4 are expressed in the region injected. Injections were 
performed on the ventral surface on embryos cultured with their ventral surface facing up 
(New, D.A.T. (1955) Embryol E)q>. Morph. 3:320-31. Embryos were harvested 18-28 hours 
after injection and prepared for whole mount in situ hybridization (see above description of in 

20 situ experiment), hybridized with Sonic or BJMP-4 digoxigenin labeled probes. 

In Situ Hybridization with Hox Genes 

Cloned cDNA of the chicken homologues of /faxflf-9,-10,-1 1,-13 ; 6-9, c-9,-10,-11; rf- 
25 9,-10,-1 l,-12^d -13 were used to transcribe digoxigenen-UTP labeled riboprobes for whole 
mount in situ hybridization. Domestic chick embryos were harvested into PBS and 
eviscerated. The visceral organ block was fixed in 4% parafonnaldehyde overnight and 
. processed for vAlole mount in situ hybridization. Methods and photographic technique as 
described above. 

30 

(ii) Expression of Sonic and BMP'4 in Stage 13 Chick Embryos Determined by Whole Mount 
In Situ Hybridization 

Chick gut morphogenesis begins at stage 8 (Bamberger and Hamilton, (1987) Nutr. 
35 6:14-23 with a ventral in-folding of the anterior definitive endoderm to form the anterior 
mtestinal portal (AIP) (Romanoff, A.L., (1960) The Avian Embryo, The Macmillan Co., NY. 
This lengthens posteriorly forming the foregut. A second wave of endodermal invagination is 
initiated posteriorly at stage 13, creating the caudal intestinal portal (CIP). The CIP extends 
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antenoriy forming the hindgut. Sonic expression, previously noted in the endodeim of the 
vertebrate gut (Riddle et al., (1993) Cell 75:1401-16; Echelard et al., (1993) CeU 75:1417- 
1430), is expressed early in a restricted pattern in the endodermal lips of the AIP and CIP. 

Sonic expression is detected in the endodermoftheAIP and CIP in pre gut closure stages. At 
5 later stages, stage 28 embryos. Sonic is expressed in the gut in all levels (fore-, mid-, and 
hind-gut) restricted to the endodenn. Sonic is known to be an important inductive signal in 
other regions of the embryo including the limb bud (Riddle et al., (1993) Cell 75:1401-16) 
and neural tube (Echelard et al., (1993) Cell 75:1417-1430; Kraus et al., (1994) Cell 75:1437- 
1444; Roelink et al., (1994) Cell Since primitive gut endoderm is known to 

10 cause gut-specific mesodermal differentiation vviien combined with non-gut mesenchyme 
(Haffen et al., (1987) Butr. 6:14-23), we speculated that Sonic might function as an inductive 
signal to the visceral mesoderm. A potential target gene for the action of Sonic was 
suggested by analogy to the Drosophila imaginal discs where Dros-HH, the homologue of 
vertebrate Sonic, activates the expression of the JGF-p related gene dpp in adjacent cells 

15 (Tabata abd Komberg, (1994) Cell 76:89-102; Heberlein et al., (1993) Cell 75:913-926; Ma 
et al., (1993) Cell 75:913-926; Basler et al., (1993) Cell 73:687-702). There are two 
vertebrate homologues of dpp, BMP-2 and BMP-^, The earliest detectable expression of 

5A£P-^ occurs simultaneously with the first observable expression of 5omc in the developing 
gut. BMP-4 is expressed in a domain abutting Sonic at the AIP and the CIP, but is restricted 
20 to the adjacent ventral mesoderm. ^MP-^ gut expression persists into later stage embryos, 
stage 33 embryos, in the visceral mesodenn only. The tissue restricted expression of both 
genes is maintained in all stages studied. BMP-2 is not expressed in the gut at the AIP or 
CIP, but is expressed in clusters of cells in the gut mesoderm in later stages, a pattern distinct 
&oiia Hat of BMP-4, 

25 

(Hi) Ectopic Expression of Sonic Induces Ectopic Expression ofBMP'4 in Mesodermal 
Tissues of the Developing Chick 

To test whether Sonic is capable of inducing BMP-4 in the mesodenn we an ectopic 
30 expression system previously used to study the lole of Sonic in limb development was 
utilized (Riddle et al., (1993) Cell 75:1401-16). A repUcation competent retrovirus 
engineered to express Sonic was injected unilaterally into the presunqitive endoderm and 
visceral mesoderm at mid-embryo positions in stage 8-13 chick embryos in vitro (New, 
D.A.T. (1955) Embryol Exp. Morph. 3:320-321). When embryos were examined by in situ 
' 35 hybridization 18-26 hours later, the normal wild type expression of Sonic is detected at the 
AIP, CIP, and in the midline (neural tube and notochord). Ectopic Sonic expression is 
present unilaterally on the left ventral surfece. Also, wild type Sonic expression is seen in the 
floor plate of the neural tube and notochord. Ectopic expression is seen unilaterally in the 
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visceral endoderai, its. underlying splanchnic mesoderm, and somatic mesoderm. BMP-4 
expression can be seen induced in the mesoderm at the site of injection, in addition to its 
normal expression in the mesoderm of the CIP, Wild type BMP'4 expression is seen in the 
most dorsal aspects of the neural tube and symmetrical lateral regions adjacent to the neural 
5 tube. Induced BMP-4 expression is present unilaterally in the splanchnic mesoderm at the 
site of Sonic viral injection, and not in the visceral endoderm. 

Since BMP-4 is, itself, a secreted protein, it could function as a secondary signal in an 
inductive cascade, similar to the signal cascades fiom Dros-HH to dpp in Drosophila 
imaginal discs (Tabata abd Romberg, (1994) Cell 76:89-102; Heberlein et al., (1993) Cell 
10 75:913-926; Ma et al„ (1993) Cell 75:913-926; Easier et al., (1993) Cell 73:687-702) and 
from Sonic to BMP-2 in the limb bud. In the gut, BMP''4 could act as a secondary signal 
either as part of a feedback loop to the endoderm or within the visceral mesoderm. This latter 
possibility is consistent with the finding that in mice homozygous for a deletion in the BMP-4 
gene, the ventral mesoderm fails to close. 

15 

(iv) Expression of Hox Genes in the Developing Chick Gut 

There is a striking parallel between the apparent role of Sonic as an endoderm-to- 
mesoderm signal in early vertebrate gut morphogenesis and that of its Drosophila 

20 homologue, Dros-HH. Dros-HH ( like Sonic) is e?q)ressed in the Drosophila gut endoderm 
from the earliest stages of morphogenesis (Taylor et al., (1993) Meek Dev. 42:89-96). Its 
putative receptor, patched, is found in the visceral mesoderm implicating Dros-HH (like 
Sonic) in endodmnal-mesodermal inductive interactions. This led to consideradon whether 
otiier genes known to be involved in regulating Drosophila gut development might also play 

25 a role in regulating chick gut morphogenesis. Regionally specific pattern in Drosophila gut 
endoderm is regulated by a pathway involving restricted expression of homeotic genes in the 
mesoderm (McGixmis and Krumlauf, (1992) Cell 68:283-302). Although the basis for 
patterning the vertebrate gut is poorly understood, in several other regions of the embryo Hox 
genes have been implicated as key regulators of patterns. Vertebrate Hox genes are expressed 

30 in overlapping anteroposterior domains which correlate with structural boimdaries in the 
developing hindbrain, vertebrae, and limbs (McGirmis and Krumlauf, (1992) Cell 68:283- 
302). Whole mount in situ hybridization was used to test whether these genes are also 
expressed in the developing vertebrate hindgut and whether their domains of expression 
correlate with morphologic borders of the chick gut. 

35 Lumenal gut differentiation creates three morphologically and physiologically distinct 

regions: fore-, mid-, and hind- gut. The fore-gut and hind-gut are the derivatives of the 
primitive gut tubes initiated at the AIP and CIP respectively. Ultimately these tubes meet and 
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fiise at the yolk stalk around stage 24-28. The midgut is formed from both foregut and 
hindgut primordia, just anterior and posterior to the yolk stalk* 

The most posterior derivative of the hindgut is the cloaca, the common gut-urogenital 
opening. The rest of the hindgut develops into the large intestine. The midgut/hindgut 
S border is demarcated by a paired tubal structure, the ceca (analogous to the mammalian 
appendix), which fomis as budding expansions at the midgut/hindgut border at stage 19*20. 
Anterior to the ceca, the midgut forms the small intestine. 

The expression pattern of the 5' members of the Hox gene clusters in the chick 
hindgut by whole mount in situ hybridization was studied. Hox gene expression patterns in 
10 the gut are dynamic. They are initially expressed (by stage 10) in broad mesodennal domains 
extending anteriorly and laterally. Later they become restricted. By stage 25, the Abd-B like 
genes of the Hoxa and Hoxd cluster are regionally restricted in thek expression in hindgut 
mesoderm. The most anteriorly expressed gene, Hoxa-9, has an anterior border of expression 
within the mesodem of the distal midgut (to a point approximating the distal third of Ae 
15 midgut length). Each successive gene within the A and D Hox clusters has a more posterior 
domain of expression, /fojca-10, Hoxd-9 and Hoxd-lO are restricted in their expression to the 
ceca. Hoxa-U and Hoxd-ll have an anterior limit of expression in the mid<eca at the 
approxunate midgut/hindgut boundary (Romanoff, A.L. (1960) The Avian Embryo, The 
Macmillan Co. NY). Hoxd-\2 has an anterior limit at tiie posterior bolder of the ceca and 
20 extends posteriorly throughout &e hindgut to the cloaca. HoxaA'i and Hoxd-13 are 
expressed in the most posteriorly restricted domain, in the ventral mesoderm surrounding the 
cloaca. Hoxa-13 and HoxdAZ are the only Abd-B like genes which are also expressed within 
the gut endoderm, from the ceca to the cloaca. 

The only member of the B or C Hox clusters which we found to be e3q)ressed in the 
25 hindgut is /foxc-9. The e?q)ression of ffaxc-9 overkps with its paralogues /fox^ 

9 in the midgut mesodemi, but has a sharp posterior boundary, complementary to Hoxa-l 1 
and HoxdA 1 in tiie mid-ceca. 

The restricted expression of the Abd-B like Hox genes appear to demarcate fte 
successive regions of the gut which will form the cloaca, the large intestine, the ceca, the 
30 mid-ceca at the midgut/hindgut border, and the lower portion of the midgut (perh^s 
identifying that portion of the midgut derived from the posterior gut tube3). Moreover, these 
molecular events presage regional distinctions. Expression of all Hox genes could be 
detected by stage 14, well before the hindgut lumen is closed (by stage 28) and is maintained 
in subsequent stages studied. Cytodifferentiation of the hindgut mesoderm and epithelium 
* 35 begins later, at stages 29-31 (Romanoff, A.L. (1960) The Avian Embryo, The Macmillan Co. 
NY). 

These results suggest that specific Hox genes might be responsible for regulating 
morphogenesis of the gut. Consistent with this, there is an apparent homeotic alteration in 
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the gut of a transgenic mouse in which the anterior limit of expression of Hoxc-S is shifted 
rostrally : a portion of foregut epithelium mis-differentiates as midgut (Pollock and Bieberich, 

(1992) CeU 71:911-923). 

5 (v) Conservation in the Expression of Regulatory Genes Involved in the Formation of 
Vertebrate and Drosophila Gut 

There is an intriguing parallel between the expression patterns of Sonic, and 
the Hox genes in the vertebrate gut and those of their homologues during Drosophila gat 

10 morphogenesis (Figure 15). This conservation is of particular interest because in Drosophila 
the role played by these genes has been clarified genetically. Dros-HH (like its vertebrate 
homologue. Sonic) is e?q3ressed at the earliest stages in the gut endodenn and may be a signal 
to visceral mesoderm (Taylor et al., (1993) Mech Dev. 42:89-96). Nothing is known directly 
of the relationship between Dros-HH expression and activation of expression of other genes 

15 in the Drosophila gut. However, in Drosophila imaginal discs, Dros-HH is known to activate 
the expression of dpp in a signaling cascade (Kraus et al., (1994) Cell 75:1437-1444; 
Heberlein et al., (1993) Cell 75:913-926; Ma et al., (1993) Cell 75:913-926; Basler et al., 

(1993) Cell 73:687-702). Later in gut development, the production of dpp in the mesoderm 
contributes to the regulation of the expression of homeotic genes in both the mesoderm and 

20 the endoderm (Bienz, M. (1994) TIG 10:22-26). Drosophila homeotic genes are expressed in 
tiie gut visceral mesoderm and &eir expression is known to determine the morphologic 
borders of the midgut This involves proper induction of gene expression in the adjacent 
endoderm, one of the mediators of &e interaction is dpp (Bienz, M. (1994) TIG 10:22-26). If 
Dros-HH is requred for the ultimate activation of the homeotic genes in the Drosophila 

25 midgut, this would parallel the situation in the vertebrate limb bud where Sonic functions as 
an upstream activator of the Hox genes (Riddle et al., (1993) Cell 75:1401*1416), perhaps m 
a signaling cascade involving BMP^2, 

The extraordinary conservation in the expression of regulatory genes in the vertebrate 
and Drosophila gut strongly suggests a conservation of patterning mechanisms. Pathways 

30 established by genetic studies in Drosophila provide direct insights into the molecular basis 
for the regionalization and morphogenesis of the vertebrate gut. 



Example 9 

Bacterially Expressed Hedgehog Proteins Retain Motorneuron-inducing Activity 

35 Various fragments of the mouse Shh gene were cloned into the pETllD vector as 

fusion proteins with a poly(His) leader sequmce to facilitate purification. Briefly, fusion 
genes encoding the mature M-Shh protein (corresponding to Cys-25 through Ser-437 of SEQ 
ID No. 1 1) or N-teiminal containing fiagments, and an N-termmal exogenous leader having 
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the sequence M-G.S-S-H-H-H-H-H-K L-V..?-R.G-S.H.M (SEQ ID No: 47) were cloned in 
pETl ID and introduced into E. coli. The xny.^sJtix&yShh fusion proteins were purified using 
nickel chelate chromatography according to tne vendor's instructions (Qiagen catalog 30210), 
and the poly(His) leader cleaved from the purified proteins by treatment with thrombin. 

5 Preparations of the purified Shh proteins were added to tissue explants (neural tube) 

obtained from chicken embryos and cultured in a defined media (e.g., no serum). M-Shh 
protein was added to final concentrations of between 0.5pM to 5nM, and differentiation of 
the embryonic e)q>lant tissue to motomeuron phenotype was detected by expression of Islet-1 
antigen. The bacterially produced protein was demonstrated to be active in the explant 
10 cultures at concentrations as low as 5 to 50pM. An Shh polypeptide containing all 19kd of 
the amino terminal fi^gment and £Qyproximately 9kd of the carboxyl terminal fiagment (see 
Example 6) displayed both motor neuron inducing activity and weak floor plate inducing 
activity, indicating that ftese activities likely reside wiA the N-terminal fiiagment 

15 All of the above-cited references and publications are hereby incoipomted by 

reference. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine e3q)erunentation, numerous equivalents to the specific polypeptides, nucleic acids, 
20 mefliods, assays and reagents described herein. Such equivalents are considered to be within 
the scope of this invention and are covered by the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: President and Fellows of Harvard College 

(B) STREET: 124 Mt. Aubum Street 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 02138 

(A) NAME: Imperial Cancer Research Technology Ltd. 

(B) STREET: Sardinia Street 

(C) CITY: London 

(D) STATE: N/A 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP) : WC2A3NL 

(ii) TITLE OF INVENTION: Vertebrate Embryonic Pattern-Inducing 

Proteins and Uses Related Thereto 

(iii) NUMBER OF SEQUENCES: 47^ 

(iv) CC^OPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE : 

(v) current application data: 
applic:ation number: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/176,427 

(B) FILING DATE: 3b-D£C-1993 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/356,060 

(B) FILING DATE: 14-DEC-1994 

(viii) ATT0RNEY/AC3ENT INFORMATION: 

(A) NAME: Vincent, Matthew P. 

(B) REGISTRATION NUli©BR: 36,709 

(C) REFERENCE/DOCKET NUMBER: HMI-006PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 227-7400 

(B) TELEFAX: (617) 227-5941 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : both 

(D) TOPOLOGY: linear 
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433 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1,.1277 



(xl) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATG GTC GAA ATG CTG CTG TTG ACA AGA ATT CTC TTG GTG GGC TTC ATC 48 
Net Val Glu Met Leu Leu Leu Thr Arg He Leu Leu Val Gly Phe He 
1 5 10 15 

TGC GCT CTT TTA GTC TCC TCT GGG CTG ACT TGT GGA CCA GGC AGG GGC 96 
Cys Ala Leu Leu Val Ser Ser Gly Leu Thr Cys Gly Pro Gly Arg Gly 
20 25 30 

ATT GGA AAA AGG AGG CAC CCC AAA AAG CTG ACC CCG TTA GCC TAT AAG 144 
He Gly Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 

CAG TTT ATT CCC AAT GTG GCA GAG AAG ACC CTA GGG GCC AGT GGA AGA 192 
Gin Phe He Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg 
50 55 60 

TAT GAA GGG AAG ATC ACA AGA AAC TCC GAG AGA TTT AAA GAA CTA ACC 240 
Tyr Glu Gly Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr 
65 70 75 eo 

CCA AAT TAC AAC CCT GAC ATT ATT TTT AAG GAT GAA GAG AAC ACG GGA 288 
Pro Asn Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thx Gly 
85 90 95 

GCT GAC AGA CTG ATG ACT CAG CGC TGC AAG GAC AAG CTG AAT GCC CTG 336 
Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu 
100 105 110 

GCG ATC TCG GTG ATG AAC CAG TGG CCC GGG GTG AAG CTG CGG GTG ACC 364 
Ala He Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr 
115 120 125 

GAG GGC TGG GAC GAG GAT GGC CAT CAC TCC GAG GAA TCG CTG CAC TAC 432 
Glu Gly Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr 
130 135 140 

GAG GGT CGC GCC GTG GAC ATC ACC ACG TCG GAT CGG GAC CGC AGC AAG 480 
Glu Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Ser Lys 
145 150 155 160 

TAC GGA ATG CTG GCC CGC CTC GCC GTC GAG GCC GGC TTC GAC TGG GTC 528 
Tyr Gly Met X«eu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 
165 170 175 

TAC TAC GAG TCC AAG GCG CAC ATC CAC TGC TCC GTC AAA GCA GAA AAC 576 
Tyr Tyr Glu Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn 
180 185 190 
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TCA GTG GCA GCG AAA TCA GGA GGC TGC TTC CCT GGC TCA GCC ACA GTG • 624 
Ser Val Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val 
195 200 205 

CAC CTG GAG CAT GGA GGC ACC AAG CTG GTG AAG GAC CTG AGC CCT GGG 672 
His Leu Glu His Gly Gly Thr Lys Leu Val Lys Asp Leu Ser Pro Gly 
210 215 220 

GAC CGC GTG CTG GCT GOT GAC GCG GAC GGC CGG CTG CTC TAC AGT GAC 720 
Asp Arg Val Leu Ala Ala Asp Ala Asp Gly Arg Leu Leu Tyr Ser Asp 
225 230 235 240 

TTC CTC ACC TTC CTC GAC CGG ATG GAC AGC TCC CGA AAG CTC TTC TAC 768 
Phe Leu Thr Phe Leu Asp Arg Met Asp Ser Ser Arg Lys Leu Phe Tyr 
245 250 255 

GTC ATC GAG ACG CGG CAG CCC CGG GCC CGG CTG CTA CTG ACQ GCG GCC 816 
Val lie Glu Thr Arg Gin Pro Arg Ala Arg Leu Leu Leu Thr Ala Ala 
260 265 270 

CAC CTG CTC TTT GTG GCC CCC CAG CAC AAC CAG TCG GAG GCC ACA GGG 864 
His Leu Leu Phe Val Ala Pro Gin His Asn Gin Ser Glu Ala Thr Gly 
275 280 285 

TCC ACC AGT GGC CAG GCG CTC TTC GCC AGC AAC GTG AAG CCT GGC CAA 912 
Ser Thr Ser Gly Gin Ala Leu Phe Ala Ser Asn Val Lys Pro Gly Gin 
290 295 300 

CGT GTC TAT GTG CTG GGC GAG GGC GGG CAG CAG CTG CTG CCG GCG TCT 960 
Arg Val Tyr Val Leu Gly Glu Gly Gly Gin Gin Leu Leu Pro Ala Ser 
305 310 315 320 

GTC CAC AGC GTC TCA TTG CGG GAG GAG GCG TCC GGA GCC TAC GCC CCA 1008 
Val His Ser Val Ser Leu Arg Glu Glu Ala Ser Gly Ala Tyr Ala Pro 
325 330 335 

CTC ACC GCC CAG GGC ACC ATC CTC ATC AAC CGG GTG TTG GCC TCC TGC 1056 
Leu Thr Ala Gin Gly Thr He Leu He Asn Arg Val Leu Ala Ser Cys 
340 345 350 

TAC GCC GTC ATC GAG GAG CAC AGT TGG GCC CAT TGG GCC TTC GCA CCA 1104 
Tyr Ala Val He Glu Glu His Ser Trp Ala His Trp Ala Phe Ala Pro 
355 360 365 

TTC CGC TTG GCT CAG GGG CTG CTG GCC GCC CTC TGC CCA GAT GGG GCC 1152 
Phe Arg Leu Ala Gin Gly Leu Leu Ala Ala Leu Cys Pro Asp Gly Ala 
370 375 380 

ATC CCT ACT GCC GCC ACC ACC ACC ACT GGC ATC CAT TGG TAC TCA CGG 1200 
He Pro Thr Ala Ala Thr Thr Thr Thr Gly He His Trp Tyr Ser Arg 
385 390 395 4OO 

CTC CTC TAC CGC ATC GGC AGC TGG GTG CTG GAT GGT GAC GCG CTG CAT 1248 
Leu Leu Tyr Arg He Gly Ser Trp Val Leu Asp Gly Asp Ala Leu His 
405 410 415 
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CCG CTG GGC ATG GTG GCA CCG GCC AGC TG 1277* 
Pro Leu Gly Met Val Ala Pro Ala Ser 
420 425 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1190 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION; 1..1191 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATG GCT CTG CCG GCC AGT CTG TTG CCC CTG TGC TGC TTG GCA CTC TTG 48 
Met Ala Leu Pro Ala Ser Leu Leu Pro Leu Cys Cys Leu Ala Leu Leu 
1 . 5 10 15 

GCA CTA TCT GCC CAG AGC TGC GGG CCG GGC CGA GGA CCG GTT GGC CGG 96 
Ala Leu Ser Ala Gin Ser Cys Gly Pro Gly Arg Gly Pro Val Gly Arg 
20 25 30 

CGG CGT TAT GTG CGC AAG CAA CTT GTG CCT CTG CTA TAC AAG CAG TTT 144 
Arg Arg Tyr Val Arg Lys Gin Leu Val Pro Leu Leu Tyr Lys Gin Phe 
35 40 45 

GTG CCC AGT ATG CCC GAG CGG ACC CTG GGC GCG AGT GGG CCA GCG GAG 192 
Val Pro Ser Met Pro Glu Arg Thr Leu Gly Ala Ser Gly Pro Ala Glu 
50 55 60 

GGG AGG GTA ACA AGG GGG TCG GAG CGC TTC CGG GAC CTC GTA CCC AAC 240 
Gly Arg Val Thr Arg Gly Ser Glu Arg Phe Arg Asp Leu Val Pro Asn 
65 70 75 80 

TAC AAC CCC GAC ATA ATC TTC AAG GAT GAG GAG AAC AGC GGC GCA GAC 288 
Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Ser Gly Ala Asp 
85 90 95 

CGC CTG ATG ACA GAG CGT TGC AAA GAG CGG GTG AAC GCT CTA GCC ATC 336 
Arg Leu Met Thr Glu Arg Cys Lys Glu Arg Val Asn Ala Leu Ala He 
100 105 110 

GCG GTG ATG AAC ATG TGG CCC GGA GTA CGC CTA CGT GTG ACT GAA GGC 384 
Ala Val Met Asn Met Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 

TGG GAC GAG GAC GGC CAC CAC GCA CAG GAT TCA CTC CAC TAC GAA GGC 432 
Trp Asp Glu Asp Gly His His Ala Gin Asp Ser Leu His Tyr Glu Gly 
130 135 140 
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CGT GCC TTG GAC ATC ACC ACG TCT GAC CGT GAC CGT AAT AAG TAT GGT 480 
Arg Ala Leu Asp lie Thr Thr Ser Asp Arg Asp Arg Asn Lys Tyr Gly 
145 ISO 155 160 

TTG TTG GCG CGC CTA GCT GTG GAA GCC GGA TTC GAC TGG GTC TAC TAC 528 
Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 
165 170 175 

GAG TCC CGC AAC CAC ATC CAC GTA TCG GTC AAA GCT GAT AAC TCA CTG 576 
Glu Ser Arg Asn His He His Val Ser Val Lys Ala Asp Asn Ser Leu 
180 185 190 

GCG GTC CGA GCC GGA GGC TGC TTT CCG GGA AAT GCC ACG GTG CGC TTG 624 
Ala Val Arg Ala Gly Gly Cys Phe Pro Gly Asn Ala Thr Val Arg Leu 
195 200 205 

CGG AGC GGC GAA CGG AAG GGG CTG AGG GAA CTA CAT CGT GGT GAC TGG 672 
Arg Ser Gly Glu Arg Lys Gly Leu Arg Glu Leu His Arg Gly Asp Trp 
210 215 220 

GTA CTG GCC GCT GAT GCA GCG GGC CGA GTG GTA CCC ACG CCA GTG CTG 720 
Val Leu Ala Ala Asp Ala Ala Gly Arg Val Val Pro Thr Pro Val Leu 
225 230 235 240 

CTC TTC CTG GAC CGG GAT CTG CAG CGC CGC GCC TCG TTC GTG GCT GTG 768 
Leu Phe Leu Asp Arg Asp Leu Gin Arg Arg Ala Ser Phe Val Alai Val 
245 250 255 

GAG ACC GAG CGG CCT CCG CGC AAA CTG TTG CTC ACA CCC TGG CAT CTG 816 
Glu Thr Glu Arg Pro Pro Arg Lys Leu Leu Leu Thr Pro Trp His Leu 
260 265 270 

GTG TTC GCT GCT CGC GGG CCA GCG CCT GCT CCA GGT GAC TTT GCA CCG 864 
Val Phe Ala Ala Arg Gly Pro Ala Pro Ala Pro Gly Asp Phe Ala Pro 
275 280 285 

GTG TTC GCG CGC CGC TTA CGT GCT GGC GAC TCG GTG CTG GCT CCC GGC 912 
Val Phe Ala Arg Arg Leu Arg Ala Gly Asp Ser Val Leu Ala Pro Gly 
290 295 300 

GGG GAC GCG CTC CAG CCG GCG CGC GTA GCC CGC GTG GCG CGC GAG GAA 960 
Gly Asp Ala Leu Gin Pro Ala Arg Val Ala Arg Val Ala Arg Glu Glu 
305 310 315 320 

GCC GTG GGC GTG TTC GCA CCG CTC ACT GCG CAC GGG ACG CTG CTG GTC 1008 
Ala Val Gly Val Phe Ala Pro Leu Thr Ala His Gly Thr Leu Leu Val 
325 330 335 

AAC GAC GTC CTC GCC TCC TGC TAC GCG GTT CTA GAG AGT CAC CAG TGG 1056 
Asn Asp Val Leu Ala Ser Cys Tyr Ala Val Leu Glu Ser His Gin Trp 
340 345 350 

GCC CAC CGC GCC TTC GCC CCT TTG CGG CTG CTG CAC GCG CTC GGG GCT 1104 
Ala His Arg Ala Phe Ala Pro Leu Arg Leu Leu His Ala Leu Gly Ala 
355 360 365 
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CTG CTC CCT GGG GGT GCA GTC CAG CCG ACT GGC ATG CAT TGG TAC TCT 1152 
Leu Leu Pro Gly Gly Ala Val Gin Pro Thr Gly Met His Trp Tyr Ser 
370 375 380 

CGC CTC CTT TAC CGC TTG QCC GAG GAG TTA ATG GGC TG 1190 
Arg Leu Leu Tyr Arg Leu Ala Glu Glu Leu Met Gly 
385 390 395 



(2) INFORMATION FOR SEQ ID N0:3; 

(i) SEQUENCE CHT^CTERISTICS : 

(A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1056 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAG CGC TTC AAA GAG CTC ACC CCC AAC TAC AAT CCC GAC ATC ATC TTC 48 
Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr Asn Pro Asp lie lie Phe 
15 10 15 

AAG GAC GAG GAG AAC ACG GGT GCC GAC CGC CTC ATG ACC CAG CGC TGC 96 
Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg Leu Met Thr Gin Arg Cys 
20 25 30 

AAG GAC CGT CTG AAC TCA CTG GCC ATC TCT GTC ATG AAC CAG TGG CCT 144 
Lys Asp Arg Leu Asn Ser Leu Ala lie Ser Val Met Asn Gin Trp Pro 
35 40 45 

GGT GTG AAA CTG CGG GTG ACC GAA GGC CGG GAT QAA GAT GGC CAT CAC 192 
Gly Val Lys Leu Arg Val Thr Glu Gly Arg Asp Glu Asp Gly His Ris 
50 55 60 

TCA GAG GAG TCT TTA CAC TAT GAG GGC CGC GCG GTG GAT ATC ACC ACC 240 
Ser Glu Glu Ser Leu His Tyr Glu Gly Arg Ala Val Asp lie Thr Thr 
65 70 75 80 

TCA GAC CGT GAC CGA AAT AAG TAT GGA CTG CTG GCG CGC TTA GCA GTG 288 
Ser Asp Arg Asp Arg Asn Lys Tyr Gly Leu Leu Ala Arg Leu Ala Val 
85 90 95 

GAG GCC GGC TTC GAC TGG GTG TAT TAC GAG TCC AAG GCC CAC GTG CAT 336 
Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu Ser Lys Ala His Val His 
100 105 110 

TGC TCT GTC AAG TCT GAG CAT TC6 GCC GCT GCC AAG ACA GGT GGC TGC 384 
Cys Ser Val Lys Ser Glu His Ser Ala Ala Ala Lys Thr Gly Gly Cys 
115 120 125 
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TTT CCT GCC GGA GCC CAG GTG CGC CTA GAG AAC GGG GAG CGT GTG GCC 432 
Phe Pro Ala Gly Ala Gin Val Arg Leu Glu Asn Gly Glu Arg Val Ala 
130 135 140 

CTG TCA GCT GTA AAG CCA GGA GAC CGG GTG CTG GCC ATG GGG GAG GAT 480 
Leu Ser Ala Val Lys Pro Gly Asp Arg Val Leu Ala Met Gly Glu Asp 
145 150 155 160 

GGG ACC CCC ACC TTC AGT GAT GTG CTT ATT TTC CTG GAC CGC GAG CCA 528 
Gly Thr Pro Thr Phe Ser Asp Val Leu lie Phe Leu Asp Arg Glu Pro 
165 170 175 

AAC CGG CTG AGA GCT TTC CAG GTC ATC GAG ACT CAG GAT CCT CCG CGT 576 
Asn Arg Leu Arg Ala Phe Gin Val lie Glu Thr Gin Asp Pro Pro Arg 
leo 185 190 

CGG CTG GCG CTC ACG CCT GCC CAC CTG CTC TTC ATT GCG GAC AAT CAT 624 
Arg Leu Ala Leu Thr Pro Ala His Leu Leu Phe lie Ala Asp Asn His 
195 200 205 

ACA GAA CCA GCA GCC CAC TTC CGG GCC ACA TTT GCC AGC CAT GTG CAA 672 
Thr Glu Pro Ala Ala His Phe Arg Ala Thr Phe Ala Ser His Val Gin 
210 215 220 

CCA GGC CAA TAT GTG CTG GTA TCA GGG GTA CCA GGC CTC CAG CCT GCT 720 
Pro Gly Gin Tyr Val Leu Val Ser Gly Val Pro Gly Leu Gin Pro Ala 
225 230 235 240 

CGG GTG GCA GCT GTC TCC ACC CAC GTG GCC CTT GGG TCC TAT GCT CCT 768 
Arg Val Ala Ala Val Ser Thr His Val Ala Leu Gly Ser Tyr Ala Pro 
245 250 255 

CTC ACA AGG CAT GGG ACA CTT GTG GTG GAG GAT GTG GTG GCC TCC TGC 816 
Leu Thr Arg His Gly Thr Leu Val Val Glu Asp Val Val Ala Ser Cys 
260 265 270 

TTT GCA GCT GTG GCT GAC CAC CAT CTG GCT CAG TTG GCC TTC TGG CCC 864 
Phe Ala Ala Val Ala Asp His His Leu A^a Gin Leu Ala Phe Trp Pro 
275 280 285 

CTG CGA CTG TTT CCC AGT TTG GCA TGG GGC AGC TGG ACC CCA AGT GAG 912 
Leu Arg Leu Phe Pro Ser Leu Ala Trp Gly Ser Trp Thr Pro Ser Glu 
290 295 300 

GGT GTT CAC TCC TAC CCT CAG ATG CTC TAC CGC CTG GGG CGT CTC TTG 960 
Gly Val His Ser Tyr Pro Gin Met Leu Tyr Arg Leu Gly Arg Leu Leu 
305 310 315 320 

CTA GAA GAG AGC ACC TTC CAT CCA CTG GGC ATG TCT GGG GCA GGA AGC 1008 
Leu Glu Glu Ser Thr Phe His Pro Leu Gly Met Ser Gly Ala Gly Ser 
325 330 335 



TGAAGGGACT CTAACCACTG CCCTCCTGGA ACTGCTGTGC GTGGATCC 



1056 
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(2) ZNFORM^ITION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1314 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATG CTG CTG CTG CTG GCC AGA TGT TTT CTG GTG ATC CTT, GCT TCC TCG 48 
Met Leu Leu Leu Leu Ala Arg Cys Phe Leu Val lie Leu Ala Ser Ser 
1 5 10 15 

CTG CTG GTG TGC CCC GGG CTG GCC TGT GGG CCC GGC AGG GGG TTT GGA 96 
Leu Leu Val Cys Pro Qly Leu Ala Cys Gly Pro Gly Arg Gly Phe Oly 
20 25 30 

AAG AGG CGG CAC CCC AAA AAG CTG ACC CCT TTA GCC TAC AAG CAG TTT 144 
Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe 
35 40 45 

ATT CCC AAC GTA GCC GAG AAG ACC CTA GGG GCC AQC GGC AGA TAT GAA 192 
lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu 
50 55 60 

GGG AAG ATC ACA AGA AAC TCC GAA CGA TTT AAG GAA CTC ACC CCC AAT 240 
Gly Lys He Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn 
65 70 75 80 

TAC AAC CCC GAC ATC ATA TTT AAG GAT GAG GAA AAC ACG GGA GCA GAC 288 
Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp 
85 90 95 

CGG CTG ATG ACT CAG AGG TGC AAA GAC AAG TTA AAT GCC TTG GCC ATC 336 
Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala He 
100 105 110 

TCT GTG ATG AAC CAG TGG CCT GGA GTG AGG CTG CGA GTG ACC GAG GGC 384 
Ser Val Met Asn Gin Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 

TGG GAT GAG GAC GGC CAT CAT TCA GAG GAG TCT CTA CAC TAT GAG GGT 432 
Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly 
130 135 140 

CGA GCA GTG GAC ATC ACC ACG TCC GAC CGG GAC C6C AGC AAG TAC GGC 480 
Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly 
145 150 155 160 
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ATG CTG GCT CGC CTG GOT GTG GAA GCA GGT TTC GAC TGG GTC TAC TAT 5^8 
Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 
165 170 175 

GAA TCC AAA GCT CAC ATC CAC TGT TCT GTG AAA GCA GAG AAC TCC GTG 576 
Glu Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn Ser Val 
180 165 190 

GC6 GCC AAA TCC GGC GGC TGT TTC CCG GGA TCC GCC ACC GTG CAC CTG €24 
Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu 
195 200 205 

GAG CAG GGC GGC ACC AAG CTG GTG AAG GAC TTA CGT CCC GGA GAC CGC 672 
Glu Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Arg Pro Gly Asp Arg 
210 215 220 

GTG CTG GCG GCT GAC GAC CAG GGC CGG CTG CTG TAC AGC GAC TTC CTC 720 
Val Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu 
225 230 235 240 

ACC TTC CTG GAC CGC GAC GAA GGC GCC AAG AAG GTC TTC TAC GTG ATC 768 
Thr Phe Leu Asp Arg Asp Glu Gly Ala Lys Lys Val Phe Tyr Val lie 
245 250 255 

GAG ACG CTG GAG CCG CGC GAG CGC CTG CTG CTC ACC GCC GCG CAC CTG 816 
Glu Thr Leu Glu Pro Arg Glu Arg Leu Leu Leu Thr Ala Ala His Leu 
260 265 270 

CTC TTC GTG GCG CCG CAC J^AC GAC TCG GGQ CCC ACG CCC GGG CCA AGC 864 
Leu Phe Val Ala Pro His Asn Asp Ser Gly Pro Thr Pro Gly Pro Ser 
275 280 285 

GCG CTC TTT GCC AGC CGC GTG CGC CCC GGG CAG CGC GTG TAC GTG GTG 912 
Ala Leu Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val 
290 295 300 

GCT GAA CGC GGC GGG GAC CGC CGG CTG CTG CCC GCC GCG GTG CAC AGC 960 
Ala Glu Arg Gly Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser 
305 310 315 320 

GTG ACG CTG CGA GAG GAG GAG GCG GGC GCG TAC GCG CCG CTC ACG GCG 1008 
Val Thr Leu Arg Glu Glu Glu Ala Gly Ala Tyr Ala Pro Leu Thr Ala 
325 330 335 

CAC GGC ACC ATT CTC ATC AAC CGG GTG CTC GCC TCG TGC TAC GCT GTC 1056 
His Gly Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys Tyr Ala Val 
340 345 350 

ATC GAG GAG CAC AGC TGG GCA CAC CGG GCC TTC GCG CCT TTC CGC CTG 1104 
lie Glu Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu 
355 360 365 

GCG CAC GCG CTG CTG GCC GCG CTG GCA CCC GCC CGC ACG GAC GGC GGG 1152 
Ala His Ala Zjeu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Gly Gly 
370 375 380 
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GGC GGG GGC AGC ATC CCT GCA GCG CAA TCT GCA ACG GAA GCG AGG GGC 1200 
Gly Gly Gly Ser lie Pro Ala Ala Gin Ser Ala Thr Glu Ala Arg Gly 
385 390 395 400 



GCG GAG CCG ACT GCG GGC ATC CAC TGG TAC TCG CAG CTG CTC TAC CAC 1248 
Ala Glu Pro Thr Ala Gly lie His Trp Tyr Ser Gin Leu Leu Tyr His 
405 410 415 

ATT GGC ACC TGG CTG TTG GAC AGC GAG ACC ATG CAT CCC TTG GGA ATG 1296 
lie Gly Thr Trp Leu Leu Asp Ser Glu Thr Met His Pro Leu Gly Met 
420 425 430 

GCG GTC AAG TCC AGC TG 1313 
Ala Val Lys Ser Ser 
435 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1256 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

. (A) NAME/KEY: CDS 
(B) LOCATION: l.,1257 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



ATG CGG CTT TTG ACG AGA GTG CTG CTG GTG TCT CTT CTC ACT CTG TCC 48 
Met Arg Leu Leu Thr Arg Val Leu Leu Val Ser Leu Leu Thr Leu Ser 
1.5 10 15 

TTG GTG GTG TCC GGA CTG GCC TGC G6T CCT GGC AGA GGC TAC GGC AGA 96 
Leu Val Val Ser Gly Leu Ala Cys Gly Pro Gly Arg Gly Tyr Gly Arg 
20 .25 30 



AGA AGA CAT CCG AAG AAG CTG ACA CCT CTC GCC TAC AAG CAG TTC ATA 
Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe lie 
35 40 45 



144 



CCT AAT GTC GCG GAG AAG ACC TTA GGG GCC AGC GGC AGA TAC GAG GGC 
Pro Asn Val Ala Qlu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Qlu Gly 
50 55 60 



192 



AAG ATA ACG CGC AAT TCG GAG AGA TTT AAA GAA CTT ACT CCA AAT TAC 
Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
€5 70 75 80 



240 



AAT CCC GAC ATT ATC TTT AAG GAT GAG GAG AAC ACG GGA GCG GAC AGG 
Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 
85 90 95 



288 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



/V2 

CTC ATG ACA CAG AGA TGC AAA GAC AAG CTG AAC TCG CTG GCC ATC TCT 336 
Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ser Leu Ala He Ser 
100 105 110 

GTA ATG AAC CAC TGG CCA GGG GTT AAG CTG CGT GTG ACA GAG 6GC TGG 384 
Val Met Asn His Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 120 125 

GAT GAG GAC GGT CAC CAT TTT GAA GAA TCA CTC CAC TAC GAG 6GA AGA 432 
Asp Glu Asp Gly His His Phe Glu Glu Ser Leu His Tyr Glu Gly Arg 
130 135 140 

GCT GTT GAT ATT ACC ACC TCT GAC CGA GAC AAG AGC AAA TAC GGG ACA 480 
Ala Val Asp He Thr Thr Ser Asp Arg Asp Lys Ser Lys Tyr Gly Thr 
W5 150 155 160 

CTG TCT CGC CTA GCT GTG GAG GCT GGA TTT GAC TGG GTC TAT TAC GAG 528 
Leu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 
165 170 175 

TCC AAA GCC CAC ATT CAT TGC TCT GTC AAA GCA GAA AAT TCG GTT GCT 576 
Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
180 185 190 

GCG AAA TCT GGG GGC TGT TTC CCA GGT TCG GCT CTG GTC TCG CTC CAG 624 
Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Leu Val Ser Leu Gin 
195 200 205 

GAC GGA GGA CAG AAG GCC GTG AAG GAC CTG AAC CCC GGA GAC AAG GTG 672 
Asp Gly Gly Gin Lys Ala Val Lys Asp Leu Asn Pro Gly Asp Lys Val 
210 215 220 

CTG GCG GCA GAC AGC GCG GGA AAC CTG GTG TTC AGC GAC TTC ATC ATG 720 
Leu Ala Ala Asp Ser Ala Gly Asn Leu Val Phe Ser Asp Phe He Met 
225 230 235 240 

TTC ACA GAC CGA GAC TCC ACG ACG CGA CGT GTG TTT TAC GTC ATA GAA 768 
Phe Thr Asp Arg Asp Ser Thr Thr Arg Arg Val Phe Tyr Val He Glu 
245 250 255 

ACG CAA GAA CCC GTT GAA AAG ATC ACC CTC ACC GCC GCT CAC CTC CTT 816 
Thr Gin Glu Pro Val Glu Lys lie Thr Leu Thr Ala Ala His Leu Leu 
260 265 270 

TTT GTC CTC GAC AAC TCA ACG GAA GAT CTC CAC ACC ATG ACC GCC GCG 864 
Phe Val Leu Asp Asn Ser Thr Glu Asp Leu His Thr Met Thr Ala Ala 
275 280 285 

TAT GCC AGC AGT GTC AGA GCC GGA CAA AAG GTG ATG GTT GTT GAT GAT 912 
Tyr Ala Ser Ser Val Arg Ala Gly Gin Lys Val Met Val Val Asp Asp 
290 295 300 

AGC GGT CAG CTT AAA TCT GTC ATC GTG CAG CGG ATA TAC ACG GAG GAG 960 
Ser Gly Gin Leu Lys Ser Val He Val Gin Arg He Tyr Thr Glu Glu 
305 310 315 320 
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CAG CGG GGC TCG TTC GCA CCA GTG ACT GCA CAT GGG ACC ATT GTG GTC 1008 
Gin Arg Gly Ser Phe Ala Pro Val Thr Ala His Gly Thr lie Val Val 
325 330 335 

GAC AGA ATA CTG GCG TCC TGT TAG GCC GTA ATA GAG GAC CAG GGG CTT 1056 
Asp Arg lie Leu Ala Ser Cys Tyr Ala Val lie Glu Asp Gin Gly Leu 
340 345 350 

GCG CAT TTG GCC TTC GCG CCC GCC AGG CTC TAT TAT TAC GTG TCA TCA 1104 
Ala His Leu Ala Phe Ala Pro Ala Arg Leu Tyr Tyr Tyr Val Ser Ser 
355 360 365 

TTC CTG TCC CCC AAA ACT CCA GCA GTC GGT CCA ATG CGA CTT TAC AAC 1152 
Phe Leu Ser Pro Lys Thr Pro Ala Val Gly Pro Met Arg Leu Tyr Asn 
370 375 380 

AGG AGG GGG TCC ACT GGT ACT CCA GGC TCC TGT CAT CAA ATG GGA ACG 1200 
Arg Arg Gly Ser Thr Gly Thr Pro Gly Ser Cys His Gin Met Gly Thr 
385 390 395 400 

TGG CTT TTG GAC AGC AAC ATG CTT CAT CCT TTG GGG ATG TCA GTA AAC 1248 
Trp Leu Leu Asp Ser Asn Met Leu His Pro Leu Gly Met Ser Val Asn 
405 410 415 

TCA AGC TG 1256 
Ser Ser 



(2) INFORMATION FOR SBQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1425 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATG CTG CTG CTG GCG AGA TGT CTG CTG CTA GTC CTC GTC TCC TCG CTG 48 
Met Leu Leu Leu Ala Arg Cys Leu Leu Leu Val Leu Val Ser Ser Leu 
1 5 10 15 

CTG GTA TGC TCG GGA CTG GCG TGC GGA CC6 GGC AGG GGG TTC GGG AAG 96 
Leu Val Cys Ser Gly Leu Ala Cys Gly Pro Gly Arg Gly Phe Gly Lys 
20 25 30 

AGG AGG CAC CCC AAA AAG CTG ACC CCT TTA GCC TAC AAG CAG TTT ATC 144 
Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe lie 
35 40 45 
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CCC AAT GTG GCC GAG AAG ACC CTA GGC 6CC AGC 66A AGG TAT GAA GGG 192 
Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu Gly 
50 55 60 

AAG ATC TCC AGA AAC TCC GAG CGA TTT AAG GAA CTC ACC CCC AAT TAC 240 
Lys lie Ser Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 70 75 80 



AAC CCC GAC ATC ATA TTT AAG GAT GAA GAA AAC ACC GGA GCG GAC AGG 
Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 
.85 90 95 



288 



CTG ATG ACT CAG AGG TGT AAG GAC AAG TTG AAC GCT TTG GCC ATC TCG 
Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala lie Ser 
100 105 110 



336 



GTG ATG AAC CAG TGG CCA GGA GTG AAA CTG CGG GTG ACC GAG GGC TGG 
Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 120 125 



384 



GAC GAA GAT GGC CAC CAC TCA GAG GAG TCT CTG CAC TAC GAG GGC CGC 
Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly Arg 
130 135 140 



432 



GCA GTG GAC ATC ACC ACG TCT GAC CGC GAC CGC AGC AAG TAC GGC ATG 480 
Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly Met 
145 150 155 160 

CTG GCC CGC CTG GCG GTG GAG GCC GGC TTC GAC TGG GTG TAC TAC GAG 528 
Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 
165 170 175 

TCC AAG GCA CAT ATC CAC TGC TCG GTG AAA GCA GAG AAC TCG GTG GCG 576 
ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
180 185 190 

GCC AAA TCG GGA GGC TGC TTC CCG GGC TCG GCC ACG GTG CAC CTG GAG 624 
Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu Glu 
195 200 205 



CAG GGC GGC ACC AAG CTG GTG AAG GAC CTG AGC CCC GGG GAC CGC GTG 
Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Ser Pro Gly Asp Arg Val 
210 215 220 



€72 



CTG GCG GCG GAC GAC CAG GGC CGG CTG CTC TAC AGC GAC TTC CTC ACT 
Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu Thr 
225 230 235 240 



720 



TTC CTG GAC CGC GAC GAC GGC GCC AAG AAG GTC TTC TAC GTG ATC GAG 
Phe Leu Asp Arg Asp Asp Gly Ala Lys Lys Val Phe Tyr Val He Glu 
245 250 255 



766 



ACG CGG GAG CCG CGC GAG CGC CTG CTG CTC ACC GCC GCG CAC CTG CTC 
Thr Arg Glu Pro Arg Glu Arg Leu Leu Leu Thr Ala Ala His Leu Leu 
260 265 270 



816 
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TTT GTG GCG CCG CAC AAC GAC TCG GCC ACC GGG GAG CCC GAG GCG TCC 864 
Phe Val Ala Pro His Asn Asp Ser Ala Thr Gly Glu Pro Glu Ala Ser 
275 280 265 

5 TCG GGC TCG GGG CCG CCT TCC GGG GGC GCA CTG GGG CCT CGG GCG CTG 912 
Ser Gly Ser Gly Pro Pro Ser Gly Gly Ala Leu Gly Pro Arg Ala Leu 
290 295 300 

TTC GCC AGC CGC GTG CGC CCG GGC CAG CGC GTG TAC GTG GTG GCC GAG 960 
10 Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val Ala Glu 
305 310 315 320 

CGT GAC GGG GAC CGC CGG CTC CTG CCC GCC GCT GTG CAC AGC GTG ACC 1008 
Arg Asp Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser Val Thr 
15 325 330 335 

CTA AGC GAG GAG GCC GCG GGC GCC TAC GCG CCG CTC ACG GCC CAG GGC 1056 

Leu Ser Glu Glu Ala Ala Gly Ala Tyr Ala Pro Leu Thr Ala Gin Gly 
340 345 350 

20 

ACC ATT CTC ATC AAC CGG GTG CTG GCC TCG TGC TAC GCG GTC ATC GAG 1104 

Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys Tyr Ala Val lie Glu 
355 360 365 

25 GAG CAC AGC TGG GCG CAC CGG GCC TTC GCG CCC TTC CGC CTG GCG CAC 1152 
Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu Ala His 
370 375 380 

GCG CTC CTG GCT GCA CTG GCG CCC GCG CGC ACG GAC CGC GGC GGG GAC 1200 
30 Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Arg Gly Gly Asp 
385 390 395 400 

AGC GGC GGC GGG GAC CGC GGG GGC GGC GGC GGC AGA GTA GCC CTA ACC 1248 
Ser Gly Gly Gly Asp Arg Gly Gly Gly Gly Gly Arg Val Ala Leu Thr 
35 405 410 415 

GCT CCA GGT GCT GCC GAC GCT CCG GGT GCG GGG GCC ACC GCG GGC ATC 1296 
Ala Pro Gly. Ala Ala Asp Ala Pro Gly Ala Gly Ala Thr Ala Gly lie 
420 425 430 

40 

CAC TGG TAC TCG CAG CTG CTC TAC CAA ATA GGC ACC TGG CTC CTG GAC 1344 
His Trp Tyr Ser Gin Leu Leu Tyr Gin lie Gly Thr Trp Leu Leu Asp 
435 440 445 

45 AGC GAG GCC CTG CAC CCG CTG GGC ATG GCG GTC AAG TCC AGC NNN AGC 1392 
Ser Glu Ala Leu His Pro Leu Gly Net Ala Val Lys Ser Ser Xaa Ser 
450 455 460 

CGG GGG GCC GGG GGA GGG GCG CGG GAG GGG GCC 1425 
50 Arg Gly Ala Gly Gly Gly Ala Arg Glu Gly Ala 
465 470 475 



55 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 940 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAKDEDHESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cZSNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..940 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGG CGC CTC ATG ACC CAG CGC TGC AAG GAC CGC CTG AAC TCG CTG GCT 48 
Arg Arg Leu Met Thr Gin Arg Cys Lys Asp Arg Leu Asn Ser Leu Ala 
15 10 15 

ATC TCG GTG ATG AAC CAG TGG CCC GGT GTG AAG CTG CGG GTG ACC GAG 96 
lie Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr Glu 
20 25 30 

GGC TGG GAC GAG GAC GGC CAC CAC TCA GAG GAG TCC CTG CAT TAT GAG 144 
Gly Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu 
35 40 45 

GGC CGC GCG GTG GAC ATC ACC ACA TCA GAC CGC GAC CGC AAT AAG TAT 192 
Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Asn Lys Tyr 
50 55 60 

GGA CTG CTG GCG CGC TTG GCA GTG GAG GCC GGC TTT GAC TGG GTG TAT 240 
Gly Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr 
65 70 75 80 

TAG GAG TCA AAG GCC CAC GTG CAT TGC TCC GTC AAG TCC GAG CAC TCG 288 
Tyr Glu Ser Lys Ala His Val His Cys Ser Val Lys Ser Glu His Ser 
85 90 95 

GCC GCA GCC AAG ACG GGC GGC TGC TTC CCT GCC GGA GCC CAG GTA CGC 336 
Ala Ala Ala Lys Thr Gly Gly Cys Phe Pro Ala Gly Ala Gin Val Arg 
100 105 110 

CTG GAG AGT GGG GCG CGT GTG GCC TTG TCA GCC GTG AGG CCG GGA GAC 384 
Leu Glu Ser Gly Ala Arg Val Ala Leu Ser Ala Val Arg Pro Gly Asp 
115 120 125 

CGT GTG CTG GCC ATG GGG GAG GAT GGG AGC CCC ACC TTC AGC GAT GTG 432 
Arg Val Leu Ala Met Gly Glu Asp Gly Ser Pro Thr Phe Ser Asp Val 
130 135 140 

CTC ATT TTC CTG GAC CGC GAG CCC CAC AGG CTG AGA GCC TTC CAG GTC 480 
Leu He Phe Leu Asp Arg Glu Pro His Arg Leu Arg Ala Phe Gin Val 
145 150 155 160 

ATC GAG ACT CAG GAC CCC CCA CGC CGC CTG GCA CTC ACA CCC GCT CAC 528 
He Glu Thr Gin Asp Pro Pro Arg Arg Leu Ala Leu Thr Pro Ala His 
165 170 175 
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CTG CTC TTT ACG GCT GAC AAT CAC ACG GAG CCG GCA GCC CGC TTC CGG • 576 

Leu Leu Phe Thr Ala Asp Asn His Thr Glu Pro Ala Ala Arg Phe Arg 
180 185 190 

5 GCC ACA TTT GCC AGC CAC GTG CAG CCT GGC CAG TAC GTG CTG GTG GCT 624 
Ala Thr Phe Ala Ser His Val Gin Pro Gly Gin Tyr Val Leu Val Ala 
195 200 205 

GGG GTG CCA GGC CTG CAG CCT GCC CGC GTG GCA GCT GTC TCT ACA CAC 672 
lO'^ Gly Val Pro Gly Leu Gin Pro Ala Arg Val Ala Ala Val Ser Thr His 
210 215 220 

GTG GCC CTC GGG GCC TAC GCC CCG CTC ACA AAG CAT GGG ACA CTG GTG 720 
Val Ala Leu Gly Ala Tyr Ala Pro Leu Thr Lys His Gly Thr Leu Val 
15 225 230 235 240 

GTG GAG GAT GTG GTG GCA TCC TGC TTC 6CG GCC GTG GCT GAC CAC CAC 768 
Val Glu Asp Val Val Ala Ser Cys Phe Ala Ala Val Ala Asp His His 
245 250 255 

20 

CTG GCT CAG TTG GCC TTC TGG CCC CTG AGA CTC TTT CAC AGC TTG GCA 816 
Leu Ala Gin Leu Ala Phe Trp Pro Leu Arg Leu Phe His Ser Leu Ala 
260 265 270 

25 TGG GGC AGC TGG ACC CCG GGG GAG GGT GTG CAT TGG TAC CCC CAG CTG 864 
Trp Gly Ser Trp Thr Pro Gly Glu Gly Val His. Tzp Tyr Pro Gin Leu 
275 280 285 

CTC TAC CGC CTG GGG CGT CTC CTG CTA GAA GAG GGC AGC TTC CAC CCA 912 
30 Leu Tyr Arg Leu Gly Arg Leu Leu Leu Glu Glu Gly Ser Phe His Pro 
290 295 300 

CTG GGC ATG TCC GGG GCA GGG AGC TGA 940 
Leu Gly Met Ser Gly Ala Gly Ser Xaa 
35 305 310 



(2) INFORMATION FOR SEQ ID NO: 8: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Val Glu Met Leu Leu Leu Thr Arg lie Leu Leu Val Gly Phe He 
50 1 5 10 15 

Cys Ala Leu Leu Val Ser Ser Gly Leu Thr Cys Gly Pro Gly Arg Gly 
20 25 30 



55 



He Gly Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys 
35 40 45 
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Gin Phe lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg 
50 55 60 

Tyr Glu Gly Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr 
5 65 70 75 80 

Pro Asn Tyr Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly 
85 90 95 

10 Ala Asp Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu 
100 105 XIO 

Ala lie Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr . 
115 120 125 

15 

Glu Gly Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr 
130 135 140 

Glu Gly Arg Ala Val Asp lie Thr Thr Ser Asp Arg Asp Arg Ser Lys 
20 145 150 155 160 

Tyr Gly Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val 
165 170 175 



25 Tyr Tyr Glu Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn 
180 185 190 

Ser Va:i Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val 
195 200 205 

30 

His Leu Glu His Gly Gly Thr Lys Leu Val Lys Asp Leu Ser Pro Gly 
210 215 220 

Asp Arg Val I^eu Ala Ala Asp Ala Asp Gly Arg Leu Leu Tyr Ser Asp 
35 225 230 235 240 

Phe Leu Thr Phe Leu Asp Arg Met Asp Ser Ser Arg Lys Leu Phe Tyr 
245 250 255 

40 Val lie Glu Thr Arg Gin Pro Arg Ala Arg Leu Leu Leu Thr Ala Ala 
260 265 270 

His Leu Leu Phe Val Ala Pro Gin His Asn Gin Ser Glu Ala Thr Gly 
275 280 285 

45 

Ser Thr Ser Gly Gin Ala Leu Phe Ala Ser Asn Val Lys Pro Gly Gin 
290 295 300 

Arg Val Tyr Val Leu Gly Glu Gly Gly Gin Gin Leu Leu Pro Ala Ser 
50 305 310 315 320 

Val His Ser Val Ser Leu Arg Glu Glu Ala Ser Gly Ala Tyr Ala Pro 
325 330 335 

55 Leu Thr Ala Gin Gly Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys 
340 345 350 
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Tyr Ala Val He Glu Glu His Ser Trp Ala His Trp Ala Phe Ala Pro 
355 360 365 

Phe Arg Leu Ala Gin Gly Leu Leu Ala Ala Leu Cys Pro Asp Gly Ala 
370 375 380 

He Pro Thr Ala Ala Thr Thr Thr Thr Gly He His Trp Tyr Ser Arg 
385 390 395 400 

Leu Leu Tyr Arg He Gly Ser Trp Val Leu Asp Gly Asp Ala Leu His 
405 410 415 

Pro Leu Gly Met Val Ala Pro Ala Ser 
420 425 



(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Ala Leu Pro Ala Ser Leu Leu Pro Leu Cys Cys Leu Ala Leu Leu 
1 5 10 15 

Ala Leu Ser Ala Gin Ser Cys Gly Pro Gly Arg Gly Pro Val Gly Arg 
20 25 30 

Arg Arg Tyr Val Arg Lys Gin Leu Val Pro Leu Leu Tyr Lys Gin Phe 
35 40 45 

Val Pro Ser Met Pro Glu Arg Thr Leu Gly Ala Ser Gly Pro Ala Glu 
50 55 60 

Gly Arg Val Thr Arg Gly Ser Glu Arg Phe Arg Asp Leu Val Pro Asn 
65 70 75 80 

Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Ser Gly Ala Asp 
65 90 95 

Arg Leu Met Thr Glu Arg Cys Lys Glu Arg Val Asn Ala Leu Ala He 
100 105 110 

Ala Val Met Asn Met Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 

Trp Asp Glu Asp Gly His His Ala Gin Asp Ser Leu His Tyr Glu Gly 
130 135 140 

Arg Ala Leu Asp He Thr Thr Ser Asp Arg Asp Arg Asn Lys Tyr Gly 
145 150 155 160 
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Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 
165 170 175 

Glu Ser Arg Asn His lie Bis Val Ser Val Lys Ala Asp Asn Ser Leu 
5 180 185 190 

Ala Val Arg Ala Gly Gly Cys Phe Pro Gly Asn Ala Thr Val Arg Leu 
195 200 205 

10 Arg Ser Gly Glu Arg Lys Gly Leu Arg Glu Leu His Arg Gly Asp Trp 
210 215 220 

Val Leu Ala Ala Asp Ala Ala Gly Arg Val Val Pro Thr Pro Val Leu 
225 230 235 240 

Leu Phe Leu Asp Arg Asp Leu Gin Arg Arg Ala Ser Phe Val Ala Val 
245 250 255 

Glu Thr Glu Arg Pro Pro Arg Lys Leu Leu Leu Thr Pro Trp His Leu 
20 260 265 270 

Val Phe Ala Ala Arg Gly Pro Ala Pro Ala Pro Gly Asp Phe Ala Pro 
275 280 285 

25 Val Phe Ala Arg Arg Leu Arg Ala Gly Asp Ser Val Leu Ala Pro Gly 
290 295 300 

Gly Asp Ala Leu Gin Pro Ala Arg Val Ala Arg Val Ala Arg Glu Glu 
305 310 315 320 

30 

Ala Val Gly Val Phe Ala Pro Leu Thr Ala His Gly Thr Leu Leu Val 
325 330 335 

Asn Asp Val Leu Ala Ser Cys Tyr Ala Val Leu Glu Ser His Gin Trp 
35 340 345 350 

Ala His Arg Ala Phe Ala Pro Leu Arg lieu Leu His Ala Leu Gly Ala 
355 360 365 

40 Leu Leu Pro Gly Gly Ala Val Gin Pro Thr Gly Met Bis Trp Tyr Ser 
370 375 380 

Arg Zieu Leu Tyr Arg Leu Ala Glu Glu Leu Met Gly 
385 390 395 

45 

(2) Il^RMATION FOR S£Q ID N0:10: 

(i) SEQUENCE CHAJyVCTERISTICS : 
50 (A) LENGTH: 336 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

55 

(xi) SEQUENCE DESCRIPTIC^: SEQ ID NO: 10: 
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Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr Asn Pro Asp lie lie Phe 
15 10 15 

Iiys Asp Glu Glu Asn Thr Gly Ala Asp Arg Leu Met Thr Gin Arg Cys 
20 25 30 

Lys Asp Arg Leu Asn Ser Leu Ala He Ser Val Met Asn Gin Trp Pro 
35 40 45 

Gly Val Lys Leu Arg Val Thr Qlu Gly Arg Asp Glu Asp Gly His His 
50 5S SO 

Ser Glu Glu Ser Leu His Tyr Glu Gly Arg Ala Val Asp He Thr Thr 
65 70 75 80 

Ser Asp Arg Asp Arg Asn Lys Tyr Gly Leu Leu Ala Arg Leu Ala Val 
85 90 95 

Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu Ser Lys Ala His Val His 
100 105 110 

Cys Ser Val Lys Ser Glu His Ser Ala Ala Ala Lys Thr Gly Gly Cys 
lis 120 125 

Phe Pro Ala Gly Ala Gin Val Arg Leu Glu Asn Gly Glu Arg Val Ala 
130 135 140 

Leu Ser Ala Val Lys Pro Gly Asp Arg Val Leu Ala Met Gly Glu Asp 
145 150 155 160 

Gly Thr Pro Thr Phe Ser Asp Val Leu He Phe Leu Asp Arg Glu Pro 
165 170 175 

Asn Arg Leu Arg Ala Phe Gin Val He Glu Thr Gin Asp Pro Pro Arg 
180 185 190 

Arg Leu Ala Leu Thr Pro Ala His Leu Leu Phe He Ala Asp Asn His 
195 200 205 

Thr Glu Pro Ala Ala His . Phe Arg Ala Thr Phe Ala Ser His Val Gin 
210 215 220 

Pro Gly Gin Tyr Val Leu Val Ser Gly Val Pro Gly Leu Gin Pro Ala 
225 230 235 240 

Arg Val Ala Ala Val Ser Thr His Val Ala Leu Gly Ser Tyr Ala Pro 
245 250 255 

Leu Thr Arg His Gly Thr Leu Val Val Glu Asp Val Val Ala Ser Cys 
260 265 270 



Phe Ala Ala Val Ala Asp His His Leu Ala Gin Leu Ala Phe Trp Pro 
275 280 285 



Leu Arg Leu Phe Pro Ser Leu Ala Trp Gly Ser Trp Thr Pro Ser Glu 
290 295 300 
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Gly Val His Ser Tyr Pro Gin Met Leu Tyr Arg Leu Gly Arg Leu Leu 
305 310 315 320 



Leu Glu Glu Ser Thr Phe His Pro Leu Gly Met Ser Gly Ala Gly Ser 
325 330 335 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Leu Leu Leu Leu Ala Arg Cys Phe Leu Val lie Leu Ala Ser Ser 
1 5 10 15 

Leu Leu Val Cys Pro Gly Leu Ala Cys Gly Pro Gly Arg Gly Phe Gly 
20 25 30 

Lys Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe 
35 40 45 . 

lie Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu 
50 55 60 

Qly Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Ash 
65 70 75 eo 

Tyr Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp 
85 90 95 

Arg Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala He 
100 105 110 

Ser Val Met Asn Gin Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly 
115 120 125 

Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly 
130 135 140 

Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly 
145 150 155 160 

Met Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr 
1'5 170 175 

Glu Ser Lys Ala His He His Cys Ser Val Lys Ala Glu Asn Ser Val 
180 185 190 

Ala Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu 
195 200 205 
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Glu Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Arg Pro Gly Asp Arg 
210 215 220 

Val Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu 
5 225 230 235 240 

Thr Phe Leu Asp Arg Asp Glu Gly Ala Lys Lys Val Phe Tyr Val lie 
245 250 255 

10 Glu Thr Leu Glu Pro Arg Glu Arg Leu Leu Leu Thr Ala Ala His Leu 
260 265 270 

Leu Phe Val Ala Pro His Asn Asp Ser Gly Pro Thr Pro Gly Pro Ser 
275 280 285 

15 

Ala Leu Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val . 
290 295 300 

Ala Glu Arg Gly Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser 
20 305 310 315 320 

Val Thr Leu Arg Glu Glu Glu Ala Gly Ala Tyr Ala Pro Leu Thr Ala 
325 330 335 

25 His Gly Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys Tyr Ala Val 
340 345 350 

lie Glu Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu 
355 360 365 

30 

Ala His Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Gly Gly 
370 375 380 

Gly Gly Gly Ser lie Pro Ala Ala Gin Ser Ala Thr Glu Ala Arg Gly 
35 385 390 395 400 

Ala Glu Pro Thr Ala Gly He His Trp Tyr Ser Gin Leu Leu Tyr His 
405 410 415 

40 He Gly Thr Trp Leu Leu Asp Ser Glu Thr Met His Pro Leu Gly Met 
420 425 430 

Ala Val Lys Ser Ser 
435 

45 

(2) INFORMATIOK FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 418 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
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Met Arg Leu Leu Thr Arg Val Leu Leu Val Ser Leu Leu Thr Leu Ser 
1 5 10 15 

Leu Val Val Ser Gly Leu Ala Cys Gly Pro Gly Arg Gly Tyr Gly Arg 
5 20 25 30 

Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe lie 
35 40 45 

10 Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu Gly 
50 55 60 

Lys lie Thr Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 70 75 80 

15 

Asn Pro Asp lie lie Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 
85 90 95 

Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ser Leu Ala lie Ser 
20 100 105 110 

Val Met Asn His Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 120 X25 

25 Asp Glu Asp Gly His His Phe Glu Glu Ser Leu His Tyr Glu Gly Arg 
130 135 140 

Ala Val Asp He Thr Thr Ser Asp Arg Asp Lys Ser Lys Tyr Gly Thr 
145 150 155 160 

30 

Ijeu Ser Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 
165 170 175 

Ser Lys Ala Kis He His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
35 180 185 190 

Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Leu Val Ser Leu Gin 
195 200 205 

40 Asp Gly Gly Gin Lys Ala Val Lys Asp Leu Asn Pro Gly Asp Lys Val 
210 215 220 

Leu Ala Ala Asp Ser Ala Gly Asn Leu Val Phe Ser Asp Phe He Met 
225 230 235 240 

45 

Phe Thr Asp Arg Asp Ser Thr Thr Arg Arg Val Phe Tyr Val He Glu 
245 250 255 

Thr Gin Glu Pro Val Glu Lys He Thr }ueu Thr Ala Ala His Leu Leu 
50 260 265 270 

Phe Val Leu Asp Asn Ser Thr Glu Asp Leu His Thr Met Thr Ala Ala 
275 280 265 

55 Tyr Ala Ser Ser Val Arg Ala Gly Gin Lys Val Met Val Val Asp Asp 
290 295 300 
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Ser Gly Gin Leu Lys Ser Val He Val Gin Arg He Tyr Thr Glu Glu 
305 310 315 320 

Gin Arg Gly Ser Phe Ala Pro Val Thr Ala His Gly Thr He Val Val 
325 330 335 

Asp Arg He Leu Ala Ser Cys Tyr Ala Val He Glu Asp Gin Gly Leu 
340 345 350 

Ala His Leu Ala Phe Ala Pro Ala Arg Leu Tyr Tyr Tyr Val Ser Ser 
355 360 365 

Phe Leu Ser Pro Lys Thr Pro Ala Val Gly Pro Met Arg Leu Tyr Asn 
370 375 380 

Arg Arg Gly Ser Thr Gly Thr Pro Gly Ser Cys His Gin Met Gly Thr 
385 390 395 400 

Trp Leu Leu Asp Ser Asn Met Leu His Pro Leu Gly Met Ser Val Asn 
405 410 415 

Ser Ser 



(2) INFORMATION FOR SEQ ID N0:13: 

(i) SEQX3ENCE CHARACTERISTICS: 

(A) LENGTH: 475 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Leu Leu Leu Ala Arg Cys Leu Leu Xjeu Val Leu Val Ser Ser Leu 
1 5 10 15 

Leu Val Cys Ser Gly Leu Ala Cys Gly Pro Gly Arg Gly Phe Gly Lys 
20 25 30 

Arg Arg His Pro Lys Lys Leu Thr Pro Leu Ala Tyr Lys Gin Phe He 
35 . 40 45 

Pro Asn Val Ala Glu Lys Thr Leu Gly Ala Ser Gly Arg Tyr Glu Gly 
50 55 60 

Lys He Ser Arg Asn Ser Glu Arg Phe Lys Glu Leu Thr Pro Asn Tyr 
65 70 75 80 

Asn Pro Asp He He Phe Lys Asp Glu Glu Asn Thr Gly Ala Asp Arg 
85 90 95 

Leu Met Thr Gin Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala He Ser 
100 105 110 
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Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp 
115 120 125 

Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu Gly Arg 
5 130 135 140 

Ala Val Asp lie Thr Thr Ser Asp Arg Asp Airg Ser Lys Tyr Gly Met 
145 150 155 160 

10 Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu 

165 170 175 



15 



30 



Ser Lys Ala His lie His Cys Ser Val Lys Ala Glu Asn Ser Val Ala 
180 185 190 

Ala Lys Ser Gly Gly Cys Phe Pro Gly Ser Ala Thr Val His Leu Glu 
195 200 205 



Gin Gly Gly Thr Lys Leu Val Lys Asp Leu Ser Pro Gly Asp Arg Val 
20 210 215 220 

Leu Ala Ala Asp Asp Gin Gly Arg Leu Leu Tyr Ser Asp Phe Leu Thr 
225 230 235 240 

25 Phe Leu Asp Arg Asp Asp Gly Ala Lys Lys Val Phe Tyr Val lie Glu 

245 250 255 



Thr Arg Glu Pro Arg Glu Arg Leu Leu Leu Thr Ala Ala His Leu Leu 
260 265 270 

Phe Val Ala Pro His Asn Asp Ser Ala Thr Gly Glu Pro Glu Ala Ser 
275 280 285 



Ser Gly Ser Gly Pro Pro Ser Gly Gly Ala Leu Gly Pro Arg Ala Leu 
35 290 295 300 

Phe Ala Ser Arg Val Arg Pro Gly Gin Arg Val Tyr Val Val Ala Glu 
305 310 315 320 

40 Arg Asp Gly Asp Arg Arg Leu Leu Pro Ala Ala Val His Ser Val Thr 

325 330 335 

Leu Ser Glu Glu Ala Ala Gly Ala Tyr Ala Pro Leu Thr Ala Gin Gly 
340 345 350 

45 

Thr lie Leu lie Asn Arg Val Leu Ala Ser Cys Tyr Ala Val lie Glu 
355 360 365 

Glu His Ser Trp Ala His Arg Ala Phe Ala Pro Phe Arg Leu Ala His 
50 370 375 380 

Ala Leu Leu Ala Ala Leu Ala Pro Ala Arg Thr Asp Arg Gly Gly Asp 
385 390 395 400 

55 Ser Gly Gly Gly Asp Arg Gly Gly Gly Gly Gly Arg Val Ala Leu Thr 

405 410 415 
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Ala Pro Gly Ala Ala Asp Ala Pro Gly Ala Gly Ala Thr Ala Gly lie 
420 425 430 

His Trp Tyr Ser Gin Leu Leu Tyr Gin lie Gly Thr Trp Leu Leu Asp 
5 435 440 445 

Ser Glu Ala Leu His Pro Leu Gly Met Ala Val Lys Ser Ser Xaa Ser 
450 455 460 

10 Arg Gly Ala Gly Gly Gly Ala Arg Glu Gly Ala 
465 470 475 



(2) INFORMATION FOR SEQ ID NO: 14: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

25 Arg Arg Leu Met Thr Gin Arg Cys Lys Asp Arg Leu Asn Ser Leu Ala 
1-5 10 15 

He Ser Val Met Asn Gin Trp Pro Gly Val Lys Leu Arg Val Thr Glu 
20 25 30 

30 

Gly Trp Asp Glu Asp Gly His His Ser Glu Glu Ser Leu His Tyr Glu 
35 40 45 

Gly Arg Ala Val Asp He Thr Thr Ser Asp Arg Asp Arg Asn Lys Tyr 
35 50 55 60 

Gly Leu Leu Ala Arg Leu Ala Val Glu Ala Gly Phe Asp Trp Val Tyr 
65 70 75 80 

40 Tyr Glu Ser Lys Ala His Val His Cys Ser Val Lys Ser Glu His Ser 

85 90 95 

Ala Ala Ala Lys Thr Gly Gly Cys Phe Pro Ala Gly Ala Gin Val Arg 
100 105 110 

45 

Leu Glu Ser Gly Ala Arg Val Ala Leu Ser Ala Val Arg Pro Gly Asp 
115 120 125 

Arg Val Leu Ala Met Gly Glu Asp Gly Ser Pro Thr Phe Ser Asp Val 
50 130 135 140 

Leu He Phe Leu Asp Arg Glu Pro His Arg Leu Arg Ala Phe Gin Val 
145 150 155 160 

55 He Glu Thr Gin Asp Pro Pro Arg Arg Leu Ala Leu Thr Pro Ala His 

165 170 175 
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lieu Leu Phe Thr Ala Asp Asn His Thr Glu Pro Ala Ala Arg Phe Arg 
180 185 190 

Ala Thr Phe Ala Ser His Val Gin Pro Gly Gin Tyr Val Leu Val Ala 
195 200 205 

Gly Val Pro Gly Leu Gin Pro Ala Arg Val Ala Ala Val Ser Thr His 
210 215 220 

Val Ala Leu Gly Ala Tyr Ala Pro Leu Thr Lys His Gly Thr Leu Val 
225 230 235 240 

Val Glu Asp Val Val Ala Ser Cys Phe Ala Ala Val Ala Asp His His 
245 250 255 

Leu Ala Gin Leu Ala Phe Trp Pro Leu Arg Leu Phe His Ser Leu Ala 
260 265 270 

Trp Gly Ser Trp Thr Pro Gly Glu Gly Val His Trp Tyr Pro Gin Leu 
275 280 285 

Leu Tyr Arg Leu Gly Arg Leu Leu Leu Glu Glu Gly Ser Phe His Pro 
290 295 300 

Leu Gly Met Ser Gly Ala Gly Ser Xaa 
305 . 310. 



(2) INFORMATION FOR SEQ ID N0:15: 

(i) SEQDBNCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TyPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAO'IENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gin Arg Cys Lys Asp Lys Leu Asn Ser Leu Ala lie Ser Val Met Asn 
1 5 10 15 

His Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp Asp Glu Asp 
20 25 30 

Gly His His Phe Glu Glu Ser Leu His Tyr Glu Gly Arg Ala Val Asp 
35 40 45 

lie Thr Thr Ser Asp Arg Asp Lys Ser Lys Tyr Gly Thr Leu Ser Arg 
50 55 60 
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(2) INFOSMJITION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAC^IENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Gin Arg Cys Lys Glu Lys Leu Asn Ser Leu Ala lie Ser Val Met Asn 
1 5 10 15 

Met Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp Asp Glu Asp 
20 25 30 

Gly Asn His Phe Glu Asp Ser Leu His Tyr Glu Gly Arg Ala Val Asp 
35 40 45 

lie Thr Thr Ser Ser Asp Arg Asp Arg Asn Lys Tyr Gly Met Phe Ala 
50 55 60 

Arg 
65 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAOIENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gin Arg Cys Lys Asp Lys Leu Asn Ser Leu Ala lie Ser Val Met Asn 
15 10 15 

Leu Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp Asp Glu Asp 
20 25 30 

Gly Leu His Ser Glu Glu Ser Leu His Tyr Glu Gly Arg Ala Val Asp 
35 40 45 

He Thr Thr Ser Asp Arg Asp Arg Asn Lys Tyr Arg Met Leu Ala Arg 
50 55 60 
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10 



25 



45 



50 



(2) INFOPM^TION FOR SEQ ID N0:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
15 GGAATTCCCA 6CAGNT6CTA AAGGAAGCAA GNGCTMAA '38 

(2) INFORMATION FOR SEQ ID NO: 19: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



30 (3ci) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

TCATCGATGG ACCCA6ATCQ AAANCCN6CT CTC 33 

35 (2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
{B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GCTCTAGAGC TCNACNQCNA (3ANCGTNGC 29 

(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CDNA 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

A6CTGTCGAC GC6GCC6CTA C6TAG6TTAC CGAC6TCAAG CTTAGATCTC 50 

10 (2) INFORMATION FOR SEQ ID N0:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
IS (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



25 



(ii) MOLECULE TYPE: CZ3NA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 22: 
AGCTGAGATC TAAGCTT(3AC GTCGGTAACC TACGTA6CGG CCGCXITOSAC 50 



(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: cDNA 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GATCGGCCAG GCAGGCCTCG CXSATATCGTC ACCGCGGTAT TCGAA 45 

(2) INFORMATION FOR SEQ ID N0:24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 JD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: dDNA 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
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AGT6CCA6TC 666GCCCCCA 6G6CCGCGCC 
(2) INFORMATXQH FOR SEQ ZD NO: 25: 

(i) SKQX7ENCE CKARACTERXSTICS : 

(A) LENGTH: 20 base pairs 

(B) TVPE: nucleic acid 

(C) STRAKDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TVPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TACCACAGCG GAT6GTTCG6 



(2) INFORMATION FOR SEQ ID N0:2£: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: dDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTGGTGGTTA TGCCGATCOC 



(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TAAGAGGCCT ATAAGAGGCG G 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 



21 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cZm 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AA6TCAGCCC AGAGGA6ACT 20 



(2) INFORMA.TIQN FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) T0P0X/)6Y: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTIW: SEQ ID NO: 29: 

Cys Gly Pro Gly Arg Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ. ID NO: 30: 
AGCAGNTGCT AAAGGAAGCA AGNGCTMAA 



(2) INFORMATION FOR SEQ ID N0:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(XI) SEQX7ENCE DESCRIPTION: SEQ ID NO: 31: 
CTCNACNGCN AGANCKNGTN 6CNA 



(2) INFORMTITION FOR SEQ ID NO:32: 

(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CTGCAGGGAT CCACCATGC6 GCTTTTGAC6 AG 



(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTGCAGGGAT CCTTATTCCA CACGAGGGAT T 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAC94ENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Met Asp Asn His Ser Ser Val Pro Trp Ala Ser Ala Ala Ser Val Thr 
15 10 15 
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Cys Leu Ser Leu Asp Ala Lys Cys His Ser Ser Ser Ser Ser Ser Ser 
20 25 30 

Ser Lys Ser Ala Ala Ser Ser He Ser Ala He Pro Gin Glu Glu Thr 
35 40 45 

Gin Thr Met Arg His He Ala His Thr Gin Arg Cys Leu Ser Arg Leu 
50 55 60 



10 



Thr Ser Leu Val Ala Leu Leu Leu He Val Leu Pro Met Val Phe Ser 
65 70 75 80 



15 



Pro Ala His Ser Cys Gly Pro Gly Arg Gly Leu Gly Arg His Arg Ala 
85 90 95 

Arg Asn Leu Tyr Pro Leu Val Leu Lys Gin Thr He Pro Asn Leu Ser 
100 105 no 



20 



Glu Tyr Thr Asn Ser Ala Ser Gly Pro Leu Glu Gly Val He Arg Arg 
115 120 125 



Asp Ser Pro Lys Phe Lys Asp Leu Val Pro Asn Tyr Asn Arg Asp He 
130 135 140 



25 



Leu Phe Arg Asp Glu Glu Gly Thr Gly Ala Asp Arg Leu Met Ser Lys 
145 150 155 160 



30 



Arg Cys Lys Glu Lys Leu Asn Val Leu Ala Tyr Ser Val Met Asn Glu 
165 170 175 

Trp Pro Gly He Arg Leu Leu Val Thr Glu Ser Trp Asp Glu Asp Tyr 
180 185 190 



35 



His His Gly Gin Glu Ser Leu His Tyr Glu Gly Arg Ala Val Thr He 
195 200 205 



Ala Thr Ser Asp Arg Asp Gin Ser Lys Tyr Gly Met Leu Ala Arg Leu 
210 215 220 



40 



Ala Val Glu Ala Gly Phe Asp Trp Val Ser Tyr Val Ser Arg Arg His 
225 230 235 240 



45 



He Tyr Cys Ser Val Lys Ser Asp Ser Ser He Ser Ser His Val His 
245 250 255 

Gly Cys Phe Thr Pro Glu Ser Thr Ala Leu Leu Glu Ser Gly Val Arg 
260 265 270 • 



50 



Lys Pro Leu Gly Glu Leu Ser He Gly Asp Arg. Val Leu Ser Met Thr 
275 280 285 



Ala Asn Gly Gin Ala Val Tyr Ser Glu Val He Leu Phe Met Asp Arg 
290 . 295 300 



55 



Asn Leu Glu Gin Met Gin Asn Phe Val Gin Leu His Thr Asp Gly Gly 
305 310 315 320 
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Ala Val Leu Thr Val Thr Pro Ala 
325 

Glu Ser Gin Lys Leu Thr Phe Val 
340 

Asn Gin Val Leu Val Arg Asp Val 
355 360 

Arg Val Val Lys Val Gly Ser Val 
370 375 

Leu Thr Arg Glu Gly Thr He Val 
385 390 



His Leu Val Ser Val Trp Gin Pro 
330 335 

Phe Ala Asp Arg He Glu Glu Lys 
345 350 

Glu Thr Gly Glu Leu Arg Pro Gin 
365 

Arg Ser Lys Gly Val Val Ala Pro 
380 

Val Asn Ser Val Ala Ala Ser Cys 
395 400 



Tyr Ala Val He Asn Ser Gin Ser Leu Ala His Trp Gly Leu Ala Pro 

405 410 415 

Met Arg Leu Leu Ser Thr Leu Glu Ala Trp Leu Pro Ala Lys Glu Gin 
420 425 430 

Leu His Ser Ser Pro Lys Val Val Ser Ser Ala Gin Gin Gin Asn Gly 
435 440 445 

He His Trp Tyr Ala Asn Ala Ijeu Tyr Lys Val Lys Asp Tyr Val Leu 
450 455 460 

Pro Gin Ser Trp Arg His Asp 
465 470 



(2) INFORMATION FOR SEQ ID NO: 35: 



(i) SBQUBNCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Arg Cys Lys Glu Arg Val Asn Ser Leu Ala lie Ala Val Met His Met 
15 10 15 

Trp Pro Gly Val Arg Leu Arg Val Thr Glu Gly Trp Asp Glu Asp Gly 
20 25 30 

His His Leu Pro Asp Ser Leu His Tyr Glu Gly Arg Ala Leu Asp He 
35 40 45 

Thr Thr Ser Asp Arg Asp Arg His Lys Tyr Gly Met Leu Ala Arg Leu 
50 55 60 
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167 

Ala Val Glu Ala Gly Phe Asp Trp Val 
65 70 



5 (2) INFORMTITION FOR SHQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRA^IENT TYPE: internal 

15 



20 



25 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Arg Cys Lys Asp Lys Leu Asn Ala Leu Ala He Ser Val Met Asn Gin 
1 5 10 15 

Trp Pro Gly Val Lys Leu Arg Val Thr Glu Gly Trp Asp Glu Asp Gly 
20 25 30 

His His Ser Glu Glu Ser Leu His Tyr Glu Gly Arg Ala Val Asp He 
35 40 45 



Thr Thr Ser Asp Arg Asp Arg Ser Lys Tyr Gly Met Leu Ala Arg Leu 
30 50 55 60 



Ala Val Glu Ala Gly Phe Asp Trp Val 
65 70 



(2) INFORMATION FOR SEQ ID NO: 37: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 
40 . (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

45 (v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

50 

Lys Arg Cys Lys Glu Lys Leu Asn Val Leu Ala Tyr Ser Val Met Asn 
1 5 10 15 

Glu Trp Pro Gly He Arg Leu Val Val Thr Glu Ser Trp Asp Glu Asp 
55 20 25 30 

Tyr His His Gly Gin Glu Ser Leu His Tyr Glu Gly Arg Ala Val Thr 
35 40 45 
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He Ala Thr Ser Asp Arg Asp Gin Ser Lys Tyr Gly Met Leu Ala Arg 
50 55 60 



(2) INFORMATION FOR SBQ ID NO: 38: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
AAAAGCTTTA YT6YTAYGTN GGNATHGG 28 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
AAGAATTCTA NGCRTTRTAR TTRTTNGG 28 



(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Cys Gly Pro Gly Arg Gly Xaa Gly Xaa Arg Arg His Pro Lys Lys Leu 
15 10 15 

Thr Pro Leu Ala Tyr Lys Gin Phe He Pro Asn Val Ala Glu Lys Thr 
20 25 30 
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Leu Gly Ala Ser 6ly Arg Tyr Glu 61y Lys lie Xaa Arg Asn Ser Glu 
35 40 

Arg Phe Lys Glu Leu Thr Pro Asn Tyr Asn Pro Asp lie lie Phe Lys 
50 55 60 

Asp Glu Glu Asn Thr Gly Ala Asp Arg Leu Met Thr Gin Arg Cys Lys 
€5 70 75 80 

Asp Lys Leu Asn Xaa Leu Ala lie Ser Val Met Asn Xaa Trp Pro Gly 
85 90 95 

Val Xaa Leu Arg Val Tfar Glu Gly Trp Asp Glu Asp Gly His His Xaa 
XOO 105 110 

Glu Glu Ser Leu His Tyr Glu Gly Arg Ala Val Asp lie Thr Thr Ser 
115 120 125 

Asp Arg Asp Xaa Ser Lys Tyr Gly Xaa Leu Xaa Arg Leu Ala Val Glu 
130 135 140 

Ala Gly Phe Asp Trp Val Tyr Tyr Glu Ser Lys Ala His lie His Cys 
145 150 155 160 

Ser Val Lys Ala Glu 
165 



(2) INFORMATION FOR SEQ ID N0:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Cys Gly Pro Gly Arg Gly Xaa Xaa Xaa Arg Arg Xaa Xaa Xaa Pro Lys 
1 5 10 15 

Xaa Leu Xaa Pro Leu Xsia Tyr Lys Gin Phe Xaa Pro Xaa Xaa Xaa Glu 
20 25 30 

Xaa Thr Leu Gly Ala Ser Gly Xaa Xaa Glu Gly Xaa Xaa Xaa Arg Xaa 
35 40 45 

Ser Glu Arg Phe Xaa Xaa Leu Thr Pro Asn Tyr Asn Pro Asp lie lie 
50 55 60 



Phe Lys Asp Glu Glu Asn Xaa Gly Ala Asp Arg Leu Met Thr Xaa Arg 
65 70 75 80 
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Cys Lys Xaa Xaa Xaa Asn Xaa Leu Ala lie Ser Val Met Asn Xaa Trp 
85 90 95 

Pro Gly Val Xaa Leu Arg Val Thr Glu Gly Xaa Asp Glu Asp Gly His 
100 105 110 

His Xaa Xaa Xaa Ser Leu His Tyr Glu Gly Arg Ala Xaa Asp lie Thr 
115 120 125 

Thr Ser Asp Arg Asp Xaa Xaa Lys Tyr Gly Xaa Leu Xaa Arg Leu Ala 
130 135 140 

Val Glu Ala Gly Phe Asp Trp Val Tyr Tyr Glu Ser Xaa Xaa His Xaa 
145 150 155 160 

His Xaa Ser Val Lys Xaa Xaa 
165 



(2) INFORMATION FOR SEQ ID N0:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3897 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

ATG GAC CGC GAC AGC CTC CCA CGC GTT CCG GAC ACA CAC GGC GAT GTG 
Met Asp Arg Asp Ser Leu Pro Arg Val Pro Asp Thr His Gly Asp Val 
1 5 10 15 

GTC GAT GAG AAA TTA TTC TCG GAT CTT TAG ATA CGC ACC AGC TGG GTG 
Val Asp Glu Lys Leu Phe Ser Asp Leu Tyr lie Arg Thr Ser Trp Val 
20 25 30 

GAC GCC CAA GTG GCG CTC GAT CAG ATA GAT AAG GGC AAA GCG CGT GGC 
Asp Ala Gin Val Ala Leu Asp Gin lie Asp Lys Gly Lys Ala Arg Gly 
35 40 45 

AGC CGC ACG GCG ATC TAT CTG C6A TCA GTA TTC CAG TCC CAC CTC GAA 
Ser Arg Thr Ala lie Tyr I^u Arg Ser Val Phe Gin Ser His Leu Glu 
50 55 60 

ACC CTC GGC AGC TCC GTG CAA AAG CAC GCG GGC AAG GTG CTA TTC GTG 
Thr Leu Gly Ser Ser Val Gin Lys His Ala Gly Lys Val Leu Phe Val 
65 70 75 80 
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ni 

GCT ATC CTG GTG CTG AGC ACC TTC TGC GTC GGC CTG AAG AGC GCC CAG 288 
Ala lie Leu Val Leu Ser Thr Phe Cys Val Gly Leu Lys Ser Ala Gin 
85 90 95 

5 ATC CAC TCC AAG GTG CAC CAG CTG TGG ATC CAG GAG GGC GGC GGG CTG. 336 
lie His Ser Lys Val His Gin Leu Trp lie Gin Glu Qly Gly Gly Leu 
100 105 110 

GAG GCG GAA GTG GCC TAC ACA CAG AAG ACG ATC GGC GAG'GAC GAG TCG 384 
10 Glu Ala Glu Leu Ala Tyr Thr Gin Lys Thr lie Gly Glu Asp Glu Ser 
115 120 125 

GCC ACG CAT CAG CTG CTC ATT CAG ACG ACC CAC GAC CCG.AAC GCC TCC 432 
Ala Thr His Gin Leu Leu lie Gin Thr Thr His Asp Pro Asn Ala Ser 
15 130 135 140 

GTC CTG CAT CCG CAG GCG CTG CTT GCC CAC CTG GAG GTC CTG GTC AAG 480 
Val Leu His Pro Gin Ala Leu I^eu Ala His Leu Glu Val Leu Val Lys 
145 150 155 160 

20 

GCC ACC GCC GTC AAG GTG CAC CTC TAC GAC ACC GAA TGG GGG CTG CGC 528 
Ala Thr Ala Val Lys Val His Leu Tyr Asp Thr Glu Trp Gly Leu Arg 
165 170 175 

25 GAC ATG TGC AAC ATG CCG AGC ACG CCC TCC TTC GAG GGC ATC TAC TAC 576 
Asp Met Cys Asn Met Pro Ser Thr Pro Ser Phe Glu Gly He Tyr Tyr 
180 185 190 

ATC GAG CAG ATC CTG CGC CAC CTC ATT CCG TGC TCG ATC ATC ACG CCG 624 
30 He Glu Gin He Leu Arg His Leu He Pro Cys Ser He He Thr Pro 
. 195 200 205 

CTG GAC TGT TTC TGG GAG GGA AGC CAG CTG TTG GGT CCG GAA TCA GCG 672 
Leu Asp Cys Phe Trp Glu Gly Ser Gin Leu Leu Gly Pro Glu Ser Ala 
35 210 215 220 

GTC GTT ATA CCA GGC CTC AAC CAA CGA CTC CTG TGG ACC ACA CTG AAT 720 
Val Val He Pro Gly Leu Asn Gin Arg Leu Leu Trp Thr Thr Leu Asn 
225 230 235 240 



40 



CCC GCC TCT GTG ATG CAG TAT ATG AAG CAG AAG ATG TCC GAG GAA AAG 768 
Pro Ala Ser Val Met Gin Tyr Met Lys Gin Lys Met Ser Glu Glu Lys 
245 250 255 



45 ATC AGC TTC GAC TTC GAG ACC GTG GAG CAG TAC ATG AAG CGT GCG GCC 816 
He Ser Phe Asp Phe Glu Thr Val Glu Gin Tyr Met Lys Axg Ala Ala 
260 265 270 

ATT GCG AGT GGC TAC ATG GAG AAG CCC TGC CTG AAC CCA CTG AAT CCC 864 
50 He Ala Ser Gly Tyr Met Glu Lys Pro Cys Leu Asn Pro Leu Asn Pro 
275 280 285 



55 



AAT TGC CCG GAC ACG GCA CCG AAC AAG AAC AGC ACC CAG CCG 
Asn Cys Pro Asp Thr Ala Pro Asn Lys Asn Ser Thr Gin Pro 
290 295 300 



CCG GAT 
Pro Asp 



912 
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ni 

GTG GGA GCC ATC CTG TCC GGA GGC TGC TAC GGT TAT GCC GCG AAG CAC 960 
Val Gly Ala lie Leu Ser Gly Gly Cys Tyr Gly Tyr Ala Ala Lye His 
305 310 315 320 

ATG CAC TG6 CCG GAG GAG CTG ATT GTG GGC GGA GCG AAG AGG AAC CGC 1008 
Met His Trp Pro Glu Glu Leu lie Val Gly Gly Ala Lys Arg Asn Arg 
325 330 335 

AGC GGA CAC TTG AGG AAG GCC GAG GCC CTG CAG TCG GTG GTG CAG CTG 1056 
Ser Gly His Leu Arg Lys Ala Gin Ala Leu Gin Ser Val Val Gin Leu 
340 345 350 

ATG ACC GAG AAG GAA ATG TAC GAC CAG TGG CAG GAC AAC TAC AAG GTG 1104 
Met Thr Glu Lys Glu Met Tyr Asp Gin Trp Gin Asp Asn Tyr Lys Val 
355 360 365 

CAC CAT CTT GGA TGG ACG CAG GAG AAG GCA GCG GAG GTT TTG AAC GCC 1152 
His His Leu Gly Trp Thr Gin Glu Lys Ala Ala Glu Val Leu Asn Ala 
370 375 380 

TGG CAG CGC AAC TTT TCG CGG GAG GTG GAA CAG CTG CTA CGT AAA CAG 1200 
Trp Gin Arg Asn Phe Ser Arg Glu Val Glu Gin Leu Leu Arg Lys Gin 
385 390 395 400 

TCG AGA ATT GCC ACC AAC TAC GAT ATC TAC GTG TTC AGC TCG GCT GCA 1248 
Ser Arg lie Ala Thr Asn Tyr Asp lie Tyr Val Phe Ser Ser Ala Ala 
405 410 415 

CTG GAT GAC ATC CTG GCC AAG TTC TCC CAT CCC AGC GCC TTG TCC ATT 1296 
Leu Asp Asp lie Leu Ala Lys Phe Ser His Pro Ser Ala Leu Ser He 
420 425 430 

GTC ATC GGC GTG GCC GTC ACC GTT TTG TAT GCC TTC TGC ACG CTC CTC 1344 
Val He Gly Val Ala Val Thr Val Leu Tyr Ala Phe Cys Thr Leu Leu 
435 440 445 

CGC TGG AGG GAC CCC GTC CGT GGA CAG AGC AGT GTC GGC GTG GCC GGA 1392 
Arg Trp Arg Asp Pro Val Arg Gly Gin Ser Ser Val Gly Val Ala Gly 
450 455 460 

GTT CTG CTC ATG TGC TTT AGT ACC GCC GCC GGA TTG GGA TTG TCA GCC 1440 
Val Leu Leu Met Cys Phe Ser Thr Ala Ala Gly Leu Gly Leu Ser Ala 
465 470 475 480 

CTG CTC GGT ATC GTT TTC AAT GCC GCC AGC ACC CAG GTG GTT CCG TTT 1486 
Leu Leu Gly He Val Phe Asn Ala Ala Ser Thr Gin Val Val Pro Phe 
485 490 495 

TTG GCC CTT GGT CTG GGC GTC GAT CAC ATC TTC ATG CTG ACC GCT GCC 1536 
Leu Ala Leu Gly Leu Gly Val Asp His He Phe Met Leu Thr Ala Ala 
500 505 510 

TAT GCG GAG AGC AAT CGG CGG GAG CAG ACC AAG CTG ATT CTC AAG AAA 1584 
Tyr Ala Glu Ser Asn Arg Arg Glu Gin Thr Lys Leu He Leu Lys Lys 
515 520 525 
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GTG GGA CCG AGC ATC CTG TTC AGT GCC TGC AGC ACC GCA GGA TCC TTC 1632 
Val Gly Pro Ser lie Leu Phe Ser Ala Cys Ser Thr Ala Gly Ser Phe 
530 535 540 

TTT GCG GCC GCC TTT ATT CCG GTG CCG GCT TTG AAG GTA TTC TGT CTG 1680 
Phe Ala Ala Ala Phe lie Pro Val Pro Ala Leu Lys Val Phe Cys Leu 
545 550 555 560 

CAG GCT GCC ATC GTA ATG TGC TCC AAT TTG GCA GCG GCT CTA TTG GTT 1728 
Gin Ala Ala lie Val Met Cys Ser Asn Leu Ala Ala Ala Leu Leu Val 
565 570 575 

TTT CCG GCC ATG ATT TCG TTG GAT CTA CGG AGA CGT ACC jSCC GGC AGG 1776 
Phe Pro Ala Met lie Ser Leu Asp Leu Arg Arg Arg Thr Ala Gly Arg 
580 585 590 

GCG GAC ATC TTC TGC TGC TGT TTT CCG GTG TGG AAG GAA CAG CCG AAG 1824 
Ala Asp lie Phe Cys Cys Cys Phe Pro Val Trp Lys Glu Gin Pro Lys 
595 600 605 

GTG GCA CCA CCG GTG CTG CCG CTG AAC AAC AAC AAC GGG CGC GGG GCC 1872 
Val Ala Pro Pro Val Leu Pro Leu Asn Asn Asn Asn Gly Arg Gly Ala 
610 615 620 

CGG CAT CCG AAG AGC TGC AAC AAC AAC AGG GTG GCG CTG CCC GCC CAG 1920 
Arg His Pro Lys Ser Cys Asn Asn Asn Arg Val Ala Leu Pro Ala Gin 
625 630 635 640 

AAT CCT CTG CTG GAA CAG AGG GCA GAC ATC CCT GGG AGC AGT CAC TCA 1968 
Asn Pro Leu Leu Glu Gin Arg Ala Asp He Pro Gly Ser Ser His Ser 
645 650 €55 

CTG GCG TCC TTC TCT CTG GCA ACA TTC GCC TTT CAG CAC TAC ACT CCC 2016 
Leu Ala Ser Phe Ser Leu Ala Thr Phe Ala Phe Gin His Tyr Thr Pro 
660 665 670 

TTC CTC ATG CGC AGC TGG GTG AAG TTC CTG ACC GTT ATG GGT TTC CTG 2064 
Phe Leu Met Arg Ser Trp Val Lys Phe Leu Thr Val Met Gly Phe Leu 
675 680 685 

GCG GCC CTC ATA TCC AGC TTG TAT GCC TCC ACG CGC CTT CAG GAT GGC 2112 
Ala Ala Leu He Ser Ser Leu Tyr Ala Ser Thr Arg Leu Gin Asp Gly 
690 695 700 

CTG GAC ATT ATT GAT CTG GTG CCC AAG GAC AGC AAC GAG CAC AAG TTC 2160 
Leu Asp He He Asp Leu Val Pro Lys Asp Ser Asn Glu His Lys Phe 
705 710 715 720 

CTG GAT GCT CAA ACT CGG CTC TTT GGC TTC TAC AGC ATG TAT GCG GTT 2208 
Leu Asp Ala Gin Thr Arg Leu Phe Gly Phe Tyr Ser Met Tyr Ala Val 
725 730 735 

ACC CAG GGC AAC TTT GAA TAT CCC ACC CAG CAG CAG TTG CTC AGG GAC 2256 
Thr Gin Gly Asn Phe Glu Tyr Pro Thr Gin Gin Gin Leu Leu Arg Asp 
740 745 750 
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TAC CAT GAT TCC TTT GTG CGG GTG CCA CAT GTG ATC AAG AAT GAT AAT 2304 
Tyr His Asp Ser Phe Val Arg Val Pro His Val He Lys Asn Asp Asn 
755 760 765 



GGT GGA CTG CCG GAC TTC TGG CTG CTG CTC TTC AGC GAG TGG CTG GGT 2352 
Gly Gly Leu . Pro Asp Phe Tsp Leu Leu Leu Phe Ser Glu Trp Leu Gly 
770 775 780 

AAT CTG CAA AAG ATA TTC GAC GAG GAA TAC CGC GAC GGA CGG CTG ACC 2400 
Asn Leu Gin Lys He Phe Asp Glu Glu Tyr Arg Asp Gly Arg Leu Thr 
785 790 795 800 

AAG GAG TGC TGG TTC CCA AAC GCC AGC AGC GAT GCC ATC CTG GCC TAC 2448 
Lys Glu Cys Trp Phe Pro Asn Ala Ser Ser Asp Ala He Leu Ala Tyr 
805 810 815 



AAG CTA ATC GTG CAA ACC GGC CAT GTG GAC AAC CCC GTG GAC AAG GAA 2496 
Lys Leu He Val Gin Thr Gly His Val Asp Asn Pro Val Asp Lys Glu 
820 825 830 

CTG GTG CTC ACC AAT CGC CTG GTC AAC AGC GAT GGC ATC ATC AAC CAA 2544 
Leu Val Leu Thr Asn Arg Leu Val Asn Ser Asp Gly He He Asn Gin 
835 840 845 

CGC GCC TTC TAC AAC TAT CTG TCG GCA TGG GCC ACC AAC GCG TCT TCG 2592 
Arg Ala Phe Tyr Asn Tyr Leu Ser Ala Trp Ala Thr Asn Ala Ser Ser 
850 855 860 

CCT ACG GAG CTT CTC AGG GCA AAT TGT ATC CGG AAC CGC GCC AAC GGA 2640 
Pro Thr Glu Leu Leu Arg Ala Asn Cys He Arg Asn Arg Ala Asn Gly 
865 870 875 880 



GCT TCT GAG GGC AAA TTG TAT CCG GAA CCG CGC CAG TAT TTT CAC CAA 2688 
Ala Ser Gin Gly Lys Leu Tyr Pro Glu Pro Arg Gin Tyr Phe His Gin 
885 890 895 

CCC AAC GAG TAC GAT CTT AAG ATA CCC AAG AGT CTG CCA TTG GTC TAC 2736 
Pro Asn Glu Tyr Asp Leu Lys He Pro Lys Ser Leu Pro Leu Val Tyr 
900 905 910 

GCT CAG ATG CCC TTT TAC CTC CAC GGA CTA ACA GAT ACC TCG CAG ATC 2784 
Ala Gin Met Pro Phe Tyr Leu His Gly Leu Thr Asp Thr Ser Gin He 
915 9'20 925 

AAG ACC CTG ATA GGT CAT ATT CGC GAC CTG AGC GTC AAG TAC GAG GGC 2632 
Lys Thr Leu He Gly His He Arg Asp Leu Ser Val Lys Tyr Glu Gly 
930 935 940 

TTC GGC CTG CCC AAC TAT CCA TCG GGC ATT CCC TTC ATC TTC TGG GAG 2B80 
Phe Gly Leu Pro Asn Tyr Pro Ser Gly He Pro Phe He Phe Trp Glu 
945 950 955 960 



CAG TAC ATG ACC CTG CGC TCC TCA CTG GCC ATG ATC CTG GCC TGC GTG 
Gin Tyr Met Thr Leu Arg Ser Ser Leu Ala Met He Leu Ala Cys Val 
965 970 975 



2928 
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CTA CTC GCC GGC CTG GTG CTG GTC TCC CTG CTC CTG CTC TCC GTT TGG 2976 
Leu Leu Ala Ala Leu Val Leu Val Ser Leu Leu Leu Leu Ser Val Trp 
980 965 990 

GCC GCC GTT CTC GTG ATC CTC AGC GTT CTG GCC TCG CTG GCC CAG ATC 3024 
Ala Ala Val Leu Val lie Leu Ser Val Leu Ala Ser Leu Ala' Gin lie 
995 1000 1005 

TTT GOG GCC AT6 ACT CTG CTG GGC ATC AAA CTC TCG GCC ATT CCG GCA 3072 
Phe Gly Ala Met Thr Leu Leu Gly lie Lys Leu Ser Ala He Pro Ala 
1010 1015 1020 

GTC ATA CTC ATC CTC AGC GTG GGC ATG ATG CTG TGC TTC AAT GTG CTG 3120 
Val He Leu He Leu Ser Val Gly Met Met Leu Cys Phe Asn Val Leu 
1025 103O 1035 1040 

ATA TCA CTG GGC TTC ATG ACA TCC GTT GGC AAC CGA CAG CGC CGC GTC 3168 
He Ser Leu Gly Phe Met Thr Ser Val Gly Asn Arg Gin Arg Arg Val 
1045 1050 1055 

CAG CTG AGC ATG CAG ATG TCC CTG 6GA CCA CTT GTC CAC GGC ATG CTG 3216 
Gin Leu Ser Met Gin Met Ser Leu Gly Pro Leu Val His Gly Met Leu 
1060 1065 1070 

ACC TCC GGA GTG GCC GTG TTC ATG CTC TCC ACG TCG CCC TTT GAG TTT 3264 
Thr Ser Gly Val Ala Val Phe Met Leu Ser Thr Ser Pro Phe Glu Phe 
1075 1080 1085 

GTG ATC CGG CAC TTC TGC TOG CTT CTG CTG GTG GTC TTA TOC GTT GGC 3312 
Val He Arg His Phe Cys Trp Leu Leu X«eu Val Val Leu Cys Val Gly 
1090 1095 1100 

GCC TGC AAC AGC CTT TTG GTG TTC CCC ATC CTA CTG AGC ATG GTG GGA 3360 
Ala Cys Asn Ser Leu Leu Val Phe Pro He Leu Leu Ser Met Val Gly 
1105 1110 1115 1120 

CCG GAG GCG GAG CTG GTG CCG CTG GAG CAT CCA GAC CGC ATA TCC ACG 3408 
Pro Glu Ala Glu Leu Val Pro Leu Glu His Pro Asp Arg He Ser Thr 
1125 1130 1135 

CCC TCT CCG CTG CCC GTG CGC AGC AGC AAG A6A TCG GGC AAA TCC TAT 3456 
Pro Ser Pro Leu Pro Val Arg Ser Ser Lys Arg Ser Gly Lys Ser Tyr 
1140 1145 1150 

GTG GTG CAG GGA TCG CGA TCC TCG CGA GGC AGC TGC CAG AAG TCG CAT 3504 
Val Val Gin Gly Ser Arg Ser Ser Arg Gly Ser Cys Gin Lys Ser His 
1155 1160 1165 

CAC CAC CAC CAC AAA GAC CTT AAT GAT CCA TCG CTG ACG ACG ATC ACC 3552 
His His His His Lys Asp Leu Asn Asp Pro Ser Leu Thr Thr He Thr 

1170 1175 1180 

GAG GAG CCG CAG TCG TGG AAG TCC AGC AAC TCG TCC ATC CAG ATG CCC 3600 
Glu Glu Pro Gin Ser Trp Lys Ser Ser Asn Ser Ser He Gin Met Pro 
1185 1190 1195 1200 
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AAT GAT TGG ACC TAG GAG CCX3 CGG GAA CAG CGA CCC GCC TCC TAG GCG 
Asn Asp Trp Thr Tyr Gin Pro Arg Glu Gin Arg Pro Ala Ser Tyr Ala 
1205 1210 1215 

5 GCC CGG CCC CCC GCC TAT GAG AAG GCC GCC GCC CAG CAG GAG GAG CAG 
Ala Pro Pro Pro Ala Tyr His Lys Ala Ala Ala Gin Gin His His Gin 
1220 1225 1230 

CAT CAG GGC CGG GCC ACA AGG CCC CGG GCG GCC TTC CGG ACG GCC TAT 
10 His Gin Gly Pro Pro Thr Thr Pro Pro Pro Pro Phe Pro Thr Ala Tyr 
1235 1240 1245 



3648 



3696 



3744 



15 



CGG CGG GAG CTG CAG AGG ATG GTG GTG CAG GCG GAG GTG ACG GTG GAG 
Pro Pro Glu Leu Gin Ser lie Val Val Gin Pro Glu Val Thr Val Glu 
1250 1255 1260 



3792 



20 



25 



AGG AGG GAG TGG GAC AGG AAG ACC ACC AAG GTG AGG GCG AGG GCC AAG 3840 
Thr Thr His Ser Asp Ser Asn Thr Thr Lys Val Thr Ala Thr Ala Asn 
1265 1270 1275 1280 

ATG AAG GTG GAG CTG GGC ATG CCC GGC AGG GCG GTG CGG AGG TAT AAG 3888 
lie Lys Val Glu Leu Ala Met Pro Gly Arg Ala Val Arg Ser Tyr Asn 
1285 1290 . 1295 

TTT AGG AGT TAG 3900 
Phe Thr Ser 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
ACCGAGGGGT GGGACGAAGA TGGC 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



24 



55 



(ii) MOLECULE TYPE: cDNft 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
CGCTCGGTCG TACGGCATGA ACGAC 25 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
ATGGGGATGT GTGTGTGGTC AAGTQTA 27 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TTCACAGACT CTCAAAGTGT ATTTT 25 
(2) INFORMATION FOR SEQ ID N0:47: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: interxial 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Gly Ser Ser His His His His His His Leu Val Pr Arg Gly Ser 
15 10 15 

His Met 
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What is clfliined isr 

1. A substantially pure preparation of a hedgehog-mae protein comprising a polypeptide 
having an amino acid sequence identical or homologous to a vertebrate hedgehog 
protein and not identical to SEQ ID No. 34. 

2. The hedgehog-likt protein of claim 1, which protein is a Sonic hedgehog protein, 
conpising a polypeptide having an amino acid sequence identical or homologous with 
a polypeptide represented in SEQ ID No. 1, SEQ ID No. 4, SEQ ID No. 5 or SEQ ID 
No. 6. 

3. The hedgehogAiks protein of claim 2, v/idch protein comprises a polypeptide 
rqjresented by SEQ ID No. 40. 

4. The hedgehog-Ukc protein of claim 1, which protein is an Indian hedgehog protein, 
comprising a polypeptide having an amino acid sequence identical or homologous with 
a polypeptide represented in SEQ ID No. 3 or SEQ ID No. 7. 

5. The hedgehog-mss protein of claim 1, which protein is a Desert hedgehog protein, 
comprising a polypeptide having an amino acid sequence identical or homologous with 
the polypeptide represented in SEQ ID No. 2. 

6. The hedgehog-hkt protein of claim 1, which protein comprises a polypeptide 
represented by SEQ ID No. 41. 

7. The hedgehog-Vkt protein of claim 1, vMxih. protein modulates at least one of 
proliferation, differentiation or survival of mesodermal or ectodemial cells. 

8. The hedgelwg')SkR protein of claim 7, i^iiich protein has a biological activity selected 
from the groi^ consisting of a neuronal differentiation inducing activity, a neuronal 
survival promoting activity, a limb pattemmg activity, a stem cell/progenitor cell 
inducing activity, a geim cell inducing activity, and a oomtnnation thereof 

9. The hedgehogASke protein of claim 7, which protein agonizes at least one of 
proliferation, differentiation or survival of mesodermal or ectodermal cells. 

10. The hedgehog-Wa&s protein of claim 7, which protein antagonizes at least one of 
proliferation, differentiation or survival of mesodermal or ectodermal cells. 
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1 1 . The hedgehog'X^z protein of claim 1 • which protein is glycosylated. 

12. The hedgehoZ'X^t protein of claim 1 , which protein comprises an N-terminal portion of 
a vertebrate hedgehog protein having an approximate molecular weight of 1 9kd. 

1 3 . The hedgehog-XikR protein of claim 1 , which protein comprises an C-terminal portion of 
a vertebrate hedgehog protem having an approximate molecular weight of 27kd. 

14. An immunogen comprising a portion of the protein of claim 1^ said inmiunogen being 
antigenically distinct from any portion of the protein of SEQ ID No:34. 

15. An antibody preparation specifically reactive with an epitope of the hedgehog-^t 
protein of claim 1. 

16. A recombinant protein comprising a hedgehog polypeptide sequence identical or 
homologous to a vertebrate hedgehog protein, or a portion thereof, which recombinant 
protein functions in one of either role of an agonist of at least one biological activity of 
said vertebrate hedgehog protein or an antagonist of at least one biological activity of 
said vertebrate hedgehog protein, said hedgehog polypeptide sequence distinct from 
SEQroNo:34. 

17. The recombinant protein of claim 16, which hedgehog amino acid sequence is at least 
60 percent homologous with an amino acid sequence of a vertebrate hedgehog protein 
represented in the group consisting SEQ ID No:8, SEQ ID No:9, SEQ ID No: 10, SEQ 
ID No:l L SEQ ID No:12, SEQ ID No:13 and SEQ ID No:14. 

1 8. The recombinant protein of claim 16, which hedgehog amino acid sequence comprises 
an N-terminal portion of a vertebrate hedgehog protein corresponding approximately to 
Cys-24 through Glu-1 88 of SEQ ID No. 13. 

19. The recombinant protein of claim 16, which hedgehog anuno acid sequence comprises 
a C-terminal portion of a vertebrate hedgehog protein corresponding approximately to 
Asn-1 89 through Ala^75 of SEQ ID No. 13. 

20. The recombinant protein of claim 16, which protein is a fusion protein further 
comprising, in addition to said hedgehog polypeptide sequence, a second polypeptide 
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sequence having an amino acid sequence different than said vertebrate hedgehog 
protein. 

21. The recombinant protein of claim 20, vsiierein said second polypeptide is a detectable 
label for detectmg the presence of said fusion protein or as a matrix-binding domain for 
immobilizing said fusion protein. 

22. A lecombinant hedgehog polypeptide comprising an amino acid sequence at least 70 
percent homology with a hedgehog amino acid sequence selected from a group 
consisting of residues 27-425 of SEQ ID No:8, residues 22-396 of SEQ ID No:9, 
residues 1-336 of SEQ ID No:10, residues 25-437 of SEQ ID No:l 1, residues 24-418 of 
SEQ ID No:12, residues 24-475 of SEQ ID No:13, and residues 1-312 of SEQ ID 
No;14. 

23. An isolated or recombinant hedgehog polypeptide having an amino add sequence 
crossreactive with an antibody vt^ch specifically binds a hedgehog protein having an 
amino acid sequence selected from the group consisting of SEQ ID No:8, SEQ ID 
No:9, SEQ ID No:10, SEQ ID No:l 1, SEQ ID No:12, SEQ ID No:13 and SEQ ID 
No:14. 

24. An isolated or recombinant hedgehog polypeptide encoded by a nucleic acid M*ich 
hybridizes under stringent conditions to a nucleic acid sequence selected from the 
group consisting of SEQ ID No:l, SEQ ID No:2, SEQ ID No:3, SEQ ID No:4, SEQ 
IDNo:5,SEQIDNo:6and SEQIDNo:7. 

25. An isolated nucleic acid encoding a hedgehog polypeptide having an amino add 
sequence identical or homologous to a vertebrate hedgehog protein, or a portion 
thereof, and not identical to SEQ ID No. 34. 

26. The nucldc acid of claim 25, which nucldc acid encodes a hedgehog polypeptide that 
modulates at least one of proliferation, differentiation or survival of mesodermal or 
ectodermal cells. 

27. The nucleic acid of claim 25, \Mcb nucleic acid hybridizes under stringent conditions 
to a nucleic acid probe having a sequence represented by at least 12 consecutive 
nucleotides of a hedgehog nucldc acid sequence selected from tiie group consisting of 
SEQ ID No: 1, SEQ ID No:2, SEQ ID No:3. SEQ ID No;4, SEQ ID No:5, SEQ ID No:6 
and S£QIDNo:7. 



wo 95/18856 PCTAJS94/14992 



I2t 

28. The nucleic acid of claim 25, further comprising a transcriptional regulatory sequence 
operably linked to said nucleotide sequence so as to render said nucleic acid suitable for 
use as an expression vector. 

5 

29. An expression vector, csq)able of replicating in at least one of a prokaryotic cell and 
eukaryotic cell, comprising the nucleic acid of claim 25 . 

30. A host cell transfected ivith the expression vector of claim 29 and expressing said 
1 0 hedgehog polypeptide. 

3L A method of producing a recombinant hedgehog polypeptide comprising culturing the 
cell of claim 30 in a cell culture medium to express said hedgehog polypeptide and 
isolating said hedgehog polypeptide from said cell culture. 



15 



32. A transgenic animal having cells which harbor a transgene comprising the nucleic acid 
of claim 25. 



33. A recombinant transfection system, comprising 

20 (i) a gene construct including the nucleic acid of claim 25 and operably linked to a 

transcriptional regulatory sequence for causing e)q}ression of said hedgehog 
polypeptide in eukaryotic cells, and 
(ii) a gene delivery composition for delivering said gene construct to a cell and 
causing the cell to be transfected with said gene construct. 

25 

34. The recombinant transfection system of claim 33, wherein the gene delivery 
composition is selected from a group consistmg of a recombinant viral particle, a 
liposome, and a poly-cationic nucleic acid binding agent, 

30 35. A transgenic vertebrate animal in which hedgehog inductive pathways are inhibited m 
one or more tissue of said ammal by one of either expression of an antagonistic 
hedgehog polypeptide or disruption of a hedgehog gene. 

36. A probe/primer comprising a substantially purified oligonucleotide, said 
35 oUgonucleotide contaming a region of nucleotide sequence which hybridizes under 

stringent conditions to at least 15 consecutive nucleotides of sense or antisense 
sequence of SEQ ID No:l, SEQ ID No:2, SEQ ID No:3, SEQ ID No:4, SEQ ID No:5, 
SEQ ID No:6, SEQ ID No:7, or naturally occurring mutants thereof. 
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37. The probe/primer of claim 36, which probe/primer fiirther comprises a label group 
attached thoeto and able to be detected. 

5 38. A test kit for detecting cells v»*ich contain a vertebrate hedgehog mRNA transcript, 
comprising a probe/primer of claim 36 for measuring, in a sample of cells, a level of 
nucleic acid encoding a vertebrate Aedfge/io^ protein. 

39. A method of determining if a subject is at risk for a disorder characterized by 

10 unwanted cell differentiation, dedififerentiation or proliferation, comprising detecting, 

in a sample of tissue of said subject, the presence or absence of a genetic lesion 
diaracterized by at least one of 

a mutation of a gene encoding a hedgAog protein, or a homolog thereof; and 
the mis-e>qiression of said gene. 

15 

40. The method of daim 39, wherein detecting said genetic lesion comprises 
ascercaining the existence of at least one of 

i. a deletion of one or more nucleotides from said gene, 

ii. an addition of one or more nucleotides to said gene, 
20 iii. an substitution of one or more nucleotides of said gene, 

iv. a gross chromosomal rearrangement of said gene. 
V. a gross alteration in the level of a messenger RNA transcript of said 
gene, 

vi. the presence of a non-wild type splicing pattern of a messenger RNA 
25 transcript of said gene, and 

vii. a non*wild type level of said protein. 

41. The method of claim 39, wherein deteaing said genetic lesion comprises 
i. providing a probe/primer comprising an oligonucleotide containing a 

region of nucleotide sequence which hybridizes to a sense or antisense 
sequence of one or more of SEQ m No:l, SEQ ID No:2, SEQ ID No:3, SEQ 
ID No:4, SEQ ID No:5, SEQ ID No:6, SEQ ID No:7, or natundly ocolrring 
mutants thereof or 5' or 3' flanking sequences naturally associated with said 
gene; 

35 ii. ejcposing said probe/ primer to nucleic acid of said tissue; and 

iii. detecting, by hybridization of said probe/primer to said nucleic acid, tiie 
presence or absence of said genetic lesion. 



30 
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42. The method of claim 39, wherein detecting said lesion comprises utilizing said 
probe/primer to detennine the nucleotide sequence of said gene and, optionally, of 
said flanking nucleic acid sequences. 

43. The method of claim 39, >^iierem detecting said lesion comprises utilizing said 
probe/primo- to in a polymerase chain reaction (PGR). 

44. The method of claim 39, wherem detecting said lesion comprises utjlizing said 
probe/primer in a ligation diain reaction (LCR). 

45. The method of claim 40, viierein the level of said protein is detected in an 
immunoass^. 

46. A composition suitable for phaimaceutical administration con^nising 

(i) at least one polypeptide capable of fimctioning in one of either lole of an 
ag(»ust of at least one biological activity of a vertebrate hedgehog 
protein or an antagonist of at least one biological activity of said 
vertebrate Aei/geAog protein; and 

(ii) a phannaceutically accqitable carrier. 

47. The pharmaceutical composition of daim 46, wherem said polypeptide conqnises an 
amino acid sequence selected fiom the group consisting of SEQ ID No:8, SEQ ID 
No:9, SEQ ID No.lO, SEQ ID No:l 1, SEQ ID No:12. SEQ ID No:13, SEQ ID No:14, 
SEQ ID No. 34, SEQ ID No. 40 and SEQ ID No. 41. 

48. A method for inducing a cell to differentiate to a neuronal cell phenolype, comprising 
contacting said cell with a jnotein of claim 1 . 

49. A method for modulating, m an animal, cell growth, cell differentiation or cell survival, 
comprising administering a flierapeutically effective amount of a hedgehog polypeptide. 

50. A metiiod for treating a degenerative disorder of the nervous system characterized by 
n«ironal cell death, coh^rising administering to a patient a therapeutically effective 
amoimt of a pharmaceutical preparation of a hedgehog polypeptide comprising and 
anuno acid sequence selected from tiie group consisting of SEQ ID No:8, SEQ ID 
No:9, SEQ ID No:10, SEQ ID No:l 1, SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, 
SEQ ID No. 34, SEQ ID No. 40, SEQ ID No. 41 or homologs tiiereof. 
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51. The method of claim 50, wherein said therapeutically effective amount of said 
hedgehog polypeptide inhibits the de-differentiation of neuronal cells of said patient 

52. The method of claim 5 1 , v^erein said neural cell phenotype is a glial cell. 

53. The method of claim 1 >;^erein said neural cell phenotype is a nerve cell. 

54. The me&od of claim 50 wherein said degenerative disorder is a neuromuscular disorder. 

55. The method of claim 50, wherein said degenerative disorder is a autonomic disorder. 

56. The method of claim 50, herein said degenerative disorder is a central nervous system 
disorder. 

57. The method of claim 50, wherein said degenerative disorder is selected fiom a group 
consisting of Akdieimer's disease, Parkinson's disease, amyotrophic lateral sclerosis, 
Pick's disease, Huntington's disease, multiple sclerosis, neuronal damage resulting fiom 
anoxia^ischemia, neuronal damage resulting fiom trauma, and neuronal degeneration 
associated with a natural aging process. 

58. The method of claim 50, further comprising administering a tiien^utically effective 
amount of a growth fSstctor having neurotrophic activity is adminitrtgnp^ tn cfl^H p?»rii»yi t 

59. The metiiod of claim 58, wherein said growth factor is selected fiom a group consisting 
of a nerve growth factor, cilliary neurotrophic growth fector, schwanoma-dOTved 
growth factor, glial growth &ctor, striatal-derived neuronotrophic £actor, platelet- 
derived growth factor. 

60. A peptidomimetic of a portion of a vertebrate hedgehog protein, which peptidomimetic 
modulates at least a portion of the biological activities of said vertebrate hedgehog 
protein. 

61. A differentiated cell generated in cell culture by contacting an undifferentiated cell fiom 
a vertebrate organism with a hedgehog polypeptide. 

62. The differentiated cell of claim 61 , which differendated cell is a neuronal cell. 

63. The differentiated cell of claim 62, which neuronal cell is a dopaminergic cell. 
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